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(54) Title: LEN1TVIRAL VECTORS, RELATED REAGENTS, AND METHODS OF USE THEREOF 



O (57) Abstract: The present invention provides new lentiviral vectors, including lentiviral transfer pi asmids and infectious lentiviral 
® particles. Lentiviral vectors of the invention were designed to offer a number of desirable features including reduced size, conve- 
nient cloning sites (including multiple cloning sites and sites for particularly useful restriction enzymes), loxP sites, self-inactivating 
^ LTRs, etc. Certain of the vectors are optimized for expression of reporter genes and/or for expression of siRNAs or shRNAs within 
J> eukaryotic cells. The invention also provides three and four plasmid lentiviral expression systems. In addition, the invention provides 
^ a variety of methods for using the vectors including gene silencing in cells and transgenic animals, and methods of treating disease. 
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LENTIVIRAL VECTORS, RELATED REAGENTS, 
AND METHODS OF USE THEREOF 

Cross-Reference to Related Applications 
5 [0001] This application claims priority to U.S. Provisional Patent Applications 
Ser. No. 60/408,558, filed September 6, 2002, Ser. No. 60/414,195, filed September 
27, 2002, and Ser. No. 60/428,039, filed November 21, 2002. The contents of each of 
these applications is incorporated herein by reference. 

10 Background of the Invention 

[0002] viral vectors are efficient gene delivery tools in eukaryotic cells. Useful 
viral vectors have been created from different virus families, including retroviruses. 
Retroviruses have proven to be versatile and effective gene transfer vectors for a 
variety of applications since they are easy to manipulate, typically do not induce a 

1 5 strong anti- viral immune response, and are able to integrate into the genome of a host 
cell, leading to stable gene expression. If provided with an appropriate envelope, 
retroviruses can infect almost any type of cell. Due to these advantages a large 
number of retroviral vectors have been developed for in vitro gene transfer. In 
addition, use of retroviruses for purposes, such as the preation of transgenic or 

20 knockout animals, or for gene therapy, has been explored. 

[0003] However, vectors based on simple retroviruses (e.g., oncoretroviruses) 
have a number of disadvantages that limit their efficacy for such in vivo applications. 
For example, vectors based on simple retroviruses are generally unable to integrate 
into the genome of nondividing (postmitotic) cells. Furthermore, transgenes 

25 expressed from simple retroviruses are subject to silencing during development (22). 
To overcome these drawbacks, attention has recently focused on lentiviruses, a group 
of complex retroviruses that includes the human immunodeficiency virus (HIV). In 
addition to the major retroviral genes gag, pol, and env, lentiviruses typically include 
additional genes that play regulatory or structural roles. Unlike simple retroviruses, 

30 lentiviruses are able to integrate into the genome of non-dividing cells. Accordingly a 
variety of lentiviral vectors have been developed. However, existing lentiviral vectors 
remain less than optimal from a number of perspectives. For example, existing 
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lentiviral vectors are typically large in size, poorly characterized, and lack various 
features that facilitate cloning and uses of the vectors. Thus there remains a need in 
the art for improved lentiviral vectors. The present invention addresses this need. 
[0004] Rapid progress in technologies for sequencing genes and characterizing 
5 their expression profiles has resulted in a growing list of coding regions within 

mammalian genomes that are predicted to contribute to normal tissue function and to 
the development of disease. Traditionally, establishing gene function has been 
accomplished by gene targeting in mouse embryonic stem cells. While this 
technology has been responsible for many key breakthroughs in our understanding of 

10 the normal function as well as diseases of organs and tissues, it remains time- 
consuming and expensive to perform. Furthermore, current gene targeting approaches 
cannot be used to alter gene function in human tissues for the purposes of scientific 
investigation or gene therapy. For these reasons, alternative approaches to inhibit gene 
activity in primary cells and tissues have been explored. 

1 5 [0005] Among the most promising of these new approaches is RNA interference 
(RNAi), which has recently emerged as a rapid and efficient means to silence gene 
function in eukaryotic (including mammalian) cells. As initiallyy described in the 
nematode C elegans, RNAi involves introduction of double-stranded RNA (dsRNA) 
into a cell thereby inhibiting gene expression in a sequence dependent fashion. More 

20 recently it has been shown that shorter dsRNA species known as short interfering 
RNAs (siRNA) can silence mammalian gene expression in a specific manner, 
suggesting that RNAi can be used to study and manipulate gene function in higher 
organisms as well. However, the use of RNAi in mammalian cells and organisms is 
currently restricted by the limited delivery methods available. Accordingly, there is a 

25 need in the art for improved reagents and methods that would facilitate the use of 

RNAi in mammalian cells and organisms. The present invention addresses this need, 
among others. 



Summary of the Invention 

30 [0006] The present invention provides novel lentiviral vectors that offer a number 
of features and advantages. In one aspect, the invention provides a lentiviral vector 
comprising the following elements: a nucleic acid whose sequence includes (i) a 
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functional packaging signal; (ii) a multiple cloning site (MCS); and (iii) at least one 
additional element selected from the group consisting of: a second MCS, a second 
MCS into which a heterologous nucleic acid is inserted, a human immunodeficiency 
(HIV) FLAP element, an expression-enhancing posttranscriptional regulatory 
5 element, a target site for a site-specific recombinase, and a self-inactivating (SIN) 
long terminal repeat (LTR). The lentiviral vector may be a lentiviral transfer plasmid 
or an infectious lentiviral particle. In various embodiments of the invention the 
expression-enhancing posttranscriptional regulatory element is a woodchuck hepatitis 
virus regulatory element (WRE), and/or the target site is a loxP site. The invention 
1 0 further provides collections of lentiviral plasmids possessing the features described 
above. 

[0007] In other aspects, the invention provides cells, including mammalian cells, 
and transgenic animals that contain any of the inventive lentiviral vectors or 
proviruses derived therefrom. The invention further provides methods for making 
15 transgenic animals the cells of which comprise an inventive lentiviral vector or a 
provirus derived therefrom. 

[0008] The invention further provides a variety of lentiviral expression systems 
comprising inventive lentiviral transfer plasmids. For example, the invention 
provides a three-plasmid lentiviral expression system comprising: (a) a first plasmid 

20 whose sequence comprises a nucleic acid sequence of at least part of a lentiviral 
genome, wherein the plasmid (i) contains at least one defect in at least one gene 
encoding a lentiviral structural protein, and (ii) lacks a functional packaging signal; 
(b) a second plasmid whose sequence comprises a nucleic acid sequence of a virus, 
wherein the plasmid (i) expresses a viral envelope protein, and (ii) lacks a functional 

25 packaging signal; and (c) a third plasmid whose nucleic acid sequence includes (i) a 
functional packaging signal; (ii) a multiple cloning site (MCS); and (iii) at least one 
additional element selected from the group consisting of: a second MCS, a second 
MCS into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 
expression-enhancing posttranscriptional regulatory element, a target site for a site- 

30 specific recombinase, and a self-inactivating (SIN) LTR. 
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[0009] The invention further provides a four plasmid lentiviral expression system, 
in which three of the plasmids are as described immediately above and the fourth 
plasmid encodes the Rev protein. 

[0010] The invention provides methods of creating infectious lentiviral particles 
5 and of creating producer cell lines that produce infectious lentiviral particles. The 
lentiviral particles may, but need not be, derived from the lentiviral transfer plasmids 
as described herein. 

[0011] The invention further provides a method for introducing and expressing a 
heterologous nucleic acid in a target cell comprising introducing a lentiviral vector of 
10 the invention into the target cell and expressing the heterologous nucleic acid therein. 
In various embodiments of the invention the heterologous nucleic acid is operably 
linked to a constitutive, an inducible, or a cell type or tissue specific promoter, 
allowing conditional expression of the nucleic acid. 

[0012] In another aspect, the invention provides a method for achieving controlled 
15 expression of a heterologous nucleic acid in a cell comprising steps of: (i) inserting 
the heterologous nucleic acid into a lentiviral vector between sites for a recombinase, 
thereby producing a modified lentiviral vector; (ii) introducing the modified lentiviral 
vector or a portion thereof including at least the sites for the recombinase and the 
region between the sites into the cell and; (iii) subsequently inducing expression of 
20 the recombinase within the cell, thereby preventing expression of the heterologous 
nucleic acid within the cell. 

[0013] The invention also provides a method for expressing a transcript in a 
mammal in a cell type or tissue-specific manner comprising: (i) delivering a lentiviral 
vector to cells of the mammal, wherein the lentiviral vector comprises a heterologous 
25 nucleic acid, and wherein the heterologous nucleic acid is located between sites for a 
site-specific recombinase; and (ii) inducing expression of the site-specific 
recombinase in a subset of the cells of the mammal, thereby preventing synthesis of 
the transcript within those cells. 

[0014] In another aspect, the invention provides a lentiviral vector whose 
30 presence within a cell results in transcription of one or more ribonucleic acids (RNAs) 
that self-hybridize or hybridize to each other to form a short hairpin RNA (shRNA) or 
short interfering RNA (siRNA) that inhibits expression of at least one target transcript 
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in the cell. In certain embodiments of the invention the lentiviral vector comprises a 
nucleic acid segment operably linked to a promoter, so that transcription from the 
promoter (i.e., transcription directed by the promoter) results in synthesis of an RNA 
comprising complementary regions that hybridize to form an shRNA targeted to the 
5 target transcript. (When an RNA comprises complementary regions that hybridize 
with each other, the RNA will be said to self-hybridize.) According to certain 
embodiments of the invention the shRNA comprises a base-paired region 
approximately 19 nucleotides long. According to certain embodiments of the 
invention the RNA may comprise more than 2 complementary regions, so that self- 
1 0 hybridization results in multiple base-paired regions, separated by loops or single- 
stranded regions. The base-paired regions may have identical or different sequences 
and thus may be targeted to the same or different regions of a single transcript or to 
different transcripts. 

[0015] In certain embodiments of the invention the lentiviral vector comprises a 
15 nucleic acid segment flanked by two promoters in opposite orientation, wherein the 
promoters are operably linked to the nucleic acid segment, so that transcription from 
the promoters results in synthesis of two complementary RNAs that hybridize with 
each other to form an siRNA targeted to the target transcript. According to certain 
embodiments of the invention the siRNA comprises a base-paired region 
20 approximately 19 nucleotides long. In certain embodiments of the invention the 
lentiviral vector comprises at least two promoters and at least two nucleic acid 
segments, wherein each promoter is operably linked to a nucleic acid segment, so that 
transcription from the promoters results in synthesis of two complementary RNAs 
that hybridize with each other to form an siRNA targeted to the target transcript. The 
25 nucleic acid segment(s) present within the lentiviral vectors may be part of a larger 
nucleic acid, e.g., a heterologous nucleic acid that is inserted into the vector as 
described herein. 

[0016] The lentiviral vectors of the invention may be lentiviral transfer plasmids 
or infectious lentiviral particles (e.g., a lentivirus or pseudotyped lentivirus). As 
30 discussed further below, lentiviruses have an RNA genome. Therefore, where the 
lentiviral vector is a lentiviral particle, the viral genome must undergo reverse 
transcription and second strand synthesis to produce DNA capable of directing RNA 
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transcription. In addition, where reference is made herein to elements such as cloning 
sites, promoters, regulatory elements, etc., it is to be understood that the sequences of 
these elements are present in RNA form in the lenti viral particles of the invention and 
are present in DNA form in the lentiviral transfer plasmids of the invention. 
5 Furthermore, where a template for synthesis of an RNA is "provided by" RNA 
present in a lentiviral particle, it is understood that the RNA must undergo reverse 
transcription and second strand synthesis to produce DNA that can serve as a template 
for synthesis of RNA (transcription). 

[0017] The invention further provides pharmaceutical compositions comprising 

10 any of the inventive lentiviral vectors and a pharmaceutically acceptable carrier. 

[0018] The invention further provides a three, plasmid lentiviral expression system 
comprising (i) a lentiviral transfer plasmid, whereingthe lentiviral transfer plasmid 
directs transcription of at least one ribonucleic acid (RNA) that, when present within a 
cell, hybridizes to form an shRNA or siRNA that inhibits expression of at least one 

15 gene expressed in the cell, (ii) a packaging plasmid; and (iii) an Env-coding plasmid. 
In certain embodiments of the invention the lentiviral transfer plasmid comprises a 
nucleic acid segment operably linked to a promoter, so that transcription from the 
promoter results in synthesis of an RNA that hybridizes to form an shRNA targeted to 
a target transcript. In certain embodiments of the invention the lentiviral transfer 

20 plasmid comprises a nucleic acid segment flanked by two oppositely directed 

promoters, wherein the promoters are operably linked to the nucleic acid segment, so 
that transcription from the promoters results in synthesis of two complementary 
RNAs that hybridize with each other to form an siRNA targeted to a target transcript. 
In certain embodiments of the invention the lentiviral transfer plasmid comprises two 

25 promoters and two nucleic acid segments, wherein each promoter is operably linked 
to a nucleic acid segment, so that transcription from the promoters results in synthesis 
of two complementary RNAs that hybridize with each other to form an siRNA 
targeted to a target transcript. The lentiviral transfer plasmid may, but need not be, 
any of the inventive lentiviral transfer plasmids described herein. 

30 [0019] The invention further provides a four plasmid lentiviral expression system 
comprising a three plasmid lentiviral expression system as described above and a 
fourth plasmid that encodes the Rev protein. 
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[0020] The invention additionally provides a method of inhibiting or reducing the 
expression of a target transcript in a cell comprising delivering a lentiviral vector to 
the cell, wherein presence of the lentiviral vector within the cell results in synthesis of 
one or more RNAs that self-hybridize or hybridize with each other to form an shRNA 
5 or siRNA that inhibits expression of the target transcript. Note that where presence of 
the lentiviral vectors, particles, or plasmids of the invention results in production of an 
shRNA, the shRNA may require further processing within the cell to form an 
inhibitory structure. shRNAs that are so processed are considered to inhibit expression 
of the target transcript. 

1 0 [0021] The invention further provides a method for reversibly inhibiting or 
reducing expression of a target transcript in a cell comprising: (i) delivering a 
lentiviral vector to the cell, wherein presence of the lentiviral vector within the cell 
results in synthesis of one or more RNAs that self-hybridize or hybridize with each 
other to form an shRNA or siRNA that inhibits expression of the target transcript, 

15 wherein the lentiviral vector comprises a nucleic acid segment located between sites 
for a site-specific recombinase, which nucleic acid segment provides a template for 
transcription of the one or more RNAs; and (ii) inducing expression of the site- 
specific recombinase within the cell, thereby preventing synthesis of at least one of 
the RNAs. The vector can be a lentiviral transfer plasmid or lentiviral particle. 

20 [0022] The invention also provides a method for reversibly inhibiting or reducing 
expression of a transcript in a mammal in a cell type or tissue-specific manner 
comprising: (i) delivering to the mammal a lentiviral vector whose presence within a 
cell results in synthesis of one or more RNAs that self-hybridize or hybridize with 
each other to form an shRNA or siRNA that inhibits expression of the target 

25 transcript, wherein the lentiviral vector comprises a nucleic acid segment located 
between sites for a site-specific recombinase, which nucleic acid segment provides a 
template for transcription of the RNA; and (ii) inducing expression of the site-specific 
recombinase in a subset of the cells of the mammal, thereby preventing synthesis of at 
least one of the RNAs within the subset of cells. In any of the above methods, the cell 

30 may be a mammalian cell, the site-specific recombinase may be Cre, and the sites 
may be loxP sites. 
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[0023] The invention includes a variety of therapeutic applications for the 
inventive lentiviral vectors. In particular, the lentiviral vectors are useful for gene 
therapy. The invention provides a method of treating or preventing infection by an 
infectious agent, the method comprising the step of administering to a subject prior to, 
5 simultaneously with, or after exposure of the subject to the infectious agent, a 
composition comprising an effective amount of a lentiviral vector, wherein the 
lentiviral vector directs transcription of at least one RNA that hybridizes to form an 
shRNA or siRNA that is targeted to a transcript produced during infection by the 
infectious agent, which transcript is characterized in that reduction in levels of the 
10 transcript delays, prevents, or inhibits one or more aspects of infection by or 
replication of the infectious agent. 

[0024] In addition, the invention provides a method of treating or preventing a 
disease or clinical condition, the method comprising: (i) removing a population of 
cells from a subject at risk of or suffering from disease or clinical condition; (ii) 

15 engineering or manipulating the cells to contain an effective amount of an siRNA or 
shRNA targeted to a transcript by infecting or transfecting the cells with a lentiviral 
vector, wherein the transcript is characterized in that its degradation delays, prevents, 
or inhibits one or more aspects of the disease or clinical condition; (iii) and returning 
at least a portion of the cells to the subject. Suitable lentiviral vectors are described 

20 herein. Without intending to suggest any limitation, the therapeutic approaches may 
find particular use in diseases such as cancer, in which a mutation in a cellular gene is 
responsible for or contributes to the pathogenesis of the disease, and in which specific 
inhibition of the target transcript bearing the mutation may be achieved by expressing 
an siRNA or shRNA targeted to the target transcript within the cells, without 

25 interfering with expression of the normal allele. According to certain embodiments of 
the invention, rather than removing cells from the body of a subject, infecting or 
transfecting them in tissue culture and then returning them to the subject, inventive 
lentiviral vectors or lenti viruses are delivered directly to the subject. 
[0025] This application refers to various patents, journal articles, and other 

30 publications, all of which are incorporated herein by reference. In addition, the 
following publications are incorporated herein by reference: Current Protocols in 
Molecular Biology, Current Protocols in Immunology \ Current Protocols in Protein 
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Science, and Current Protocols injCell Biology, John Wiley & Sons, N.Y., edition as 
of July 2002; Sambrook, Russell, and Sambrook, Molecular Cloning: A Laboratory 
Manual, 3 rd ed., Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 2001. 
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[0026] 


Figure 1 shows a map of pBFGW. 




[0027] 


Figure 2 shows a map of pLL3.0. 




[0028] 


Figure 3 shows a map of pLL3.L 




[0029] 


Figure 4 shows a map of pLL3.2. 


10 


[0030] 


Figure 5 shows a map of pLL3.3. 




[0031] 


Figure 6 shows a map of pLL3 .4. 




[0032] 


Figure 7 shows a map of pLL3.5. 




[0033] 


Figure 8 shows a map of pLL3.6. 




[0034] 


Figure 9 shows a map of pLL3.7. 


15 


[0035] 


Figure 10A shows schematic diagrams of the HIV provirus (upper panel) 



and relevant portions of representative packaging and Env-coding plasmids (middle 
and lower panels, respectively) for a three plasmid system. 

[0036] Figure 10B shows schematic diagrams of the HIV provirus (upper panel) 
and relevant portions of representative packaging, Rev-coding and Env-coding 
20 plasmids (second, third, and lower panels, respectively) for a four plasmid system. 
[0037] Figure 11 shows the siRNA structure found to be active in the Drosophila 
system. 

[0038] Figure 12 presents a schematic representation of the steps involved in 
RNA interference in Drosophila. 
25 [0039] Figure 13 shows a schematic diagram of a variety of exemplary shRNA 
structures useful in accordance with the present invention. 

[0040] Figure 14 presents a representation of an alternative inhibitory pathway, in 
which the DICER enzyme cleaves a substrate having a base mismatch in the stem to 
generate an inhibitory product that binds to the 3' UTR of a target transcript and 
30 inhibits translation. 
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[0041] Figure 15 presents a schematic diagram of a nucleic acid that serves as a 
template for transcription of an RNA that hybridizes to form an shRNA and also 
shows the RNA before and after hybridization. 

[0042] Figure 16 presents a schematic diagram of one example of a construct that 
5 may be used to direct transcription of sense and antisense strands of an siRNA. 
[0043] Figure 1 7 A presents a schematic representation of a portion of the 
lentivirus vector pLL3.7. Key: SIN-LTR: self-inactivating long terminal repeat; *F: 
HIV packaging signal; cPPT: central polypurine track; U6: U6 (RNA polymerase III) 
promoter; MCS: multiple cloning site; CMV: cytomegalovirus (RNA polymerase II) 
10 promoter; EGFP: enhanced green fluorescent protein; WRE: woodchuck hepatitis 
virus response element. 

[0044] Figure 1 7B presents the sequence of the CD8 stem loop used to generate 
pLL3.7 CDS (See Examples). A sequence known to silence CD8 as an siRNA (1 1) 
was adapted with a loop sequence from Paddison et al. (39) to create the final 

15 sequence. The presumed transcription initiation site is indicated by a +1 . Nucleotides 
which form the loop structure are indicated in green font (Loop). The pol III 
terminator stretch (a stretch of Us in the RNA) is indicated in red font. 
[0045] Figure 1 7C shows the predicted structure of the CD8 stem-loop RNA 
produced from pLL3.7 CD8. 

20 [0046] Figure 18A shows density plots demonstrating specific silencing of CDS 
expression by pLL3.7 CD8. CD8 + CD4 + E10 cells were either mock infected (No 
Virus), infected with a pLL3.7 (Control Virus), or pLL3.7 CD8 (CD8 RNAi virus). 
Density plots indicate the expression levels of CD4 and CD8 48 hours post-infection. 
[0047] Figure 18B presents histograms showing staining for the T cell surface 

25 markers, CD3, TCRp, and CD28. The histograms show that other surface markers 
are unaffected by silencing of CD8. E10 cells infected with pLL3.7 (green 
histograms) or pLL3.7 CD8 (pink histograms) were stained for CD3, TCRP, and 
CD28. Solid histograms represent the level of these surface markers on uninfected 
cells; 

30 [0048] Figure 19 A shows stable silencing of CD8 by pLL3 .7 CD8. Sorted 
populations of infected E10 cells were maintained in long-term culture. E10 cells 
pLL3.7 CD8 (CD8 RNAi virus) were sorted four days after infection for GFP 



Page 10 of 171 



WO 2004/022722 



PCT/US2003/028111 



expression and low CD8 expression, while cells infected with control virus were 
sorted for GFP expression only. Each population was cultured for 1 month and 
analyzed for CD8 expression via flow cytometry at weekly intervals. The CD8 and 
GFP levels expressed by infected cells 4 days following infection and after one month 
5 of culture are shown. 

[0049] Figure 19B shows a Northern blot showing specific degradation of CD8 
mRNA induced by pLL3.7 CD8. CDS and CD4 mRNA levels in uninfected E10 cells, 
or E10 cells infected with either pLL3.7 (Control Virus) or pLL3.7 CD8 (CD8 RNAi 
Virus) and sorted on the basis of GFP and CD8 expression, were assayed. The bands 

10 representing CDS and CD4 mRNA species are identified by lines (top panel). 

[0050] Figure 19C shows generation of processed shRNAs in cells infected with 
pLL3.7 CD8. The cells analysed for CDS and CD4 mRNA levels described in the 
legend to Figure 18B were also assayed for the presence of shRNAs by Northern blot. 
The location of 21, 22, and 23 nucleotide RNAs are identified by arrows, 

15 [0051] Figure 20A presents flow cytometric analysis showing specific silencing 
of genes in primary T cells by pLL3.7 CD8 and pLL3.7 CD25. CD8* TCR transgenic 
T cells were activated for 3 days with cognate peptide and then infected with pLL3.7, 
pLL3.7 CD8, or pLL3.7 CD25. The efficiency of infection was determined by 
assaying GFP expression by flow cytometry. The expression of CD8 and CD25 on 

20 infected T cells was assayed by staining with specific antibodies that bind these 
surface markers. 

[0052] Figure 20B is a bar graph showing functional silencing of genes in 
primary T cells with pLL3.7 CD25. CD8 + TCR transgenic T cells were infected and 
activated as in A. and then cultured for 48 hours in the presence of increasing 

25 concentrations of IL-2. Proliferation was assessed by 3 H-thymidine incorporation. 
[0053] Figure 21 A shows flow cytometric analysis of expression of GFP from 
pLL3.7 CD8 infection in the AK7 ES cell line. AK7 ES cells were infected with 
pLL3.7 CD8 and sorted for GFP expression (green line) and compared with 
uninfected control (purple peak). 

30 [0054] Figure 21B shows fluorescent imaging of paws of ES cell-derived mice. 
The paws of control and pLL3.7 CD8 ES chimeric mice were imaged with standard 
epifluorescence for expression of EGFP. 
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[0055] Figure 21 C shows flow cytometric identification of ES cell-derived 
thymocytes in chimeric mice. Thymocytes from noninfected (purple peak) and 
pLL3.7 CD8 (green line) ES derived mice were harvested and analyzed for GFP 
expression. 

5 [0056] Figure 2 ID is a photograph showing expression of CD4 and CDS in the 
thymus and spleen of ES cell-derived mice. Thymocytes and splenocytes from week 
old control and CD8 RNAi (pLL 3.7 CD8) ES cell-derived mice were harvested and 
stained for CD4 and CD8 expression. 

[0057] Figure 22 A shows flow cytometric analysis of EGFP expression in cells 
10 infected with an EGFP-expressing lentiviral vector in which the promoter and EGFP 
coding sequences are floxed. The solid purple peaks represent uninfected cells. The 
population of cells expressing EGFP is shown with a green line. 
[0058] Figure 22B shows flow cytometric analysis of EGFP expression in cells 
infected with an EGFP-expressing lentiviral vector 10 days after induction of Cre 
1 5 expression. The solid purple peaks represent uninfected cells. The population of 
cells expressing EGFP is shown with a green line. 

[0059] Figure 22C shows a direct flow cytometric comparison between pLL3.7 

infected D7 cells before (green line) and after (pink line) Cre delivery. 

[0060] Figure 23 shows flow cytometric analysis of CD8 expression in T cells 

20 transfected with transfer plasmids that direct expression of either an shRNA targeted 
to CD8 or an irrelevant stem-loop sequence, demonstrating silencing of CD8 by the 
CD8 shRNA. GFP expression is on the x-axis, and CD8 expression is on the y-axis. 
The upper panel shows lack of GFP expression in untransfected cells. The middle 
panel shows CD8 expression in GFP + cells transfected with a transfer plasmid 

25 targeted to an unrelated sequence. The lower panel shows reduced CD8 expression in 
GFP + cells transfected with a transfer plasmid targeted to CD8. 
[0061] Figure 24 shows flow cytometric analysis of expression of transfected 
human CD8 in wild type ES cells or ES cells infected with a mouse CD8 shRNA 
virus, demonstrating that the mouse CD8 shRNA specifically silences mouse CD8 

30 and not human CD8. 
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[0062] Figure 25 is a Northern blot showing that higher expression levels of CDS 
shRNA in cells that did (left) versus cells that did not (right) exhibit silencing of CD8 
following infection with a mouse CD8 shRNA virus. 

5 Definitions 

[0063] The term defective as used herein refers to a nucleic acid that is not 
functional with regard to either (i) encoding its gene product or (ii) serving as a 
signaling sequence. For example, a defective env gene sequence does not encode a 
functional Env protein; a defective packaging signal will not facilitate the packaging 

10 of a nucleic acid molecule that includes the defective signal. A nucleic acid may be 
defective for some but not all of its functions. For example, a defective LTR may fail 
to promote transcription of downstream sequences while still retaining the ability to 
direct integration. Nucleic acid sequences may be made defective by any means 
known in the art, including by mutagenesis, by the deletion of some or all of the 

15 sequence, by inserting a heterologous sequence into the nucleic acid sequence, by 

placing the sequence out-of-frame, or by otherwise blocking the sequence. Defective 
sequences may also occur naturally, i.e., without human intervention, such as by 
mutation, and may be isolated from viruses in which they arise. Proteins that are 
encoded by a defective nucleic acid and are therefore not functional may be referred 

20 to as defective proteins. It is to be understood that the term "defective" is relative. In 
other words, the function need not be completely eliminated but is typically 
substantially reduced relative to the comparable wild type function. Generally, a 
defective sequence exhibits less than approximately 10% of the function of the 
comparable wild type sequence, preferably less than approximately 5% of the 

25 function of the comparable wild type sequence, yet more preferably less than 

approximately 2%, less than approximately 1%, less than approximately 0.5%, or 
approximately 0%, i.e., below the limits of detection. 

[0064] The terms deleted or deletion are used herein in accordance with their 
standard usage in the art, i.e., meaning either total removal of the specified segment or 
30 the removal of a sufficient portion of the specified segment to render the segment 
inoperative or nonfunctional with respect to at least one of its functions. 
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[0065] The term heterologous as used herein in reference to a nucleic acid, refers 
broadly to a first nucleic acid that is inserted into a second nucleic acid such as a 
plasmid or vector. In particular, the term refers to a nucleic acid that is not naturally 
present in the wild type version of a virus-based vector or plasmid that is used to 
5 deliver the sequence into a cell. The term also refers to a nucleic acid that is 

introduced into a cell, tissue, organism, etc., by artificial means including, but not 
limited to, transfection, transformation, or infection with a viral vector. Generally the 
nucleic acid is either not naturally found in the cell, tissue, or organism or, if naturally 
found therein, its expression is altered by introduction of the additional copy of the 
10 nucleic acid (e.g., if the introduced copy is under the control of a different promoter 
than the naturally occurring copy). The term is also used to refer to a protein encoded 
by such a nucleic acid sequence. If a heterologous sequence is introduced into a cell 
or organism, the sequence is considered heterologous to the progeny of such a cell or 
organism. 

1 5 [0066] The term hybridize, as used herein, refers to the interaction between two 
complementary nucleic acid sequences. The phrase hybridizes under high stringency 
conditions describes an interaction that is sufficiently stable that it is maintained under 
art-recognized high stringency conditions. Guidance for performing hybridization 
reactions can be found, for example, in Current Protocols in Molecular Biology, John 

20 Wiley & Sons, N.Y., 6.3. 1-6.3.6, 1989, and more recent updated editions, all of which 
are incorporated by reference. See also Sambrook, Russell, and Sambrook, Molecular 
Cloning: A Laboratory Manual, 3 rd ed., Cold Spring Harbor Laboratory Press, Cold 
Spring Harbor, 2001. Aqueous and nonaqueous methods are described in that 
reference and either can be used. Typically, for nucleic acid sequences over 

25 approximately 50-100 nucleotides in length, various levels of stringency are defined, 
such as low stringency (e.g., 6X sodium chloride/sodium citrate (SSC) at about 45°C, 
followed by two washes in 0.2X SSC, 0.1% SDS at least at 50°C (the temperature of 
the washes can be increased to 55°C for medium-low stringency conditions)); 
medium stringency (e.g., 6X SSC at about 45°C, followed by one or more washes in 

30 0.2X SSC, 0.1% SDS at 60°C); high stringency (e.g., 6X SSC at about 45°C, followed 
by one or more washes in 0.2X SSC, 0.1% SDS at 65°C); and very high stringency 
(e.g., 0.5 M sodium phosphate, 0.1% SDS at 65°C, followed by one or more washes at 
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0.2X SSC, 1% SDS at 65°C.) Hybridization under high stringency conditions only 
occurs between sequences with a very high degree of complementarity. One of 
ordinary skill in the art will recognize that the parameters for different degrees of 
stringency will generally differ based various factors such as the length of the 
5 hybridizing sequences, whether they contain RNA or DNA, etc. For example, 
appropriate temperatures for high, medium, or low stringency hybridization will 
generally be lower for shorter sequences such as oligonucleotides than for longer 
sequences. 

[0067] Infectious, as used herein in reference to a recombinant virus or viral 

0 particle, indicates that the virus or viral particle is able to enter cells in a manner 

substantially similar or identical to that of a wild type virus and to perform at least one 
of the functions associated with infection by a wild type virus, e.g., release of the viral 
genome in the host cell cytoplasm, entry of the viral genome into the nucleus, reverse 
transcription and integration of the viral genome into the host cell's DNA. It is not 

5 intended to indicate that the virus or viral particle is capable of undergoing replication 
or of completing the viral life cycle. The terms "viral particle" and "virus" are 
frequently used interchangeably herein. For example, the phrase "production of 
virus" may refer to production of viral particles and is not intended to indicate that 
wild type or replication competent virus is produced. 

,0 [0068] Isolated, as used herein, means 1) separated from at least some of the 

components with which it is usually associated in nature; 2) prepared or purified by a 
process that involves the hand of man; and/or 3) not occurring in nature. 
[0069] Operably linked, as used herein, refers to a relationship between two 
nucleic acid sequences wherein the expression of one of the nucleic acid sequences is 

15 controlled by, regulated by, modulated by, etc., the other nucleic acid sequence. For 
example, the transcription of a nucleic acid sequence is directed by an operably linked 
promoter sequence; post-transcriptional processing of a nucleic acid is directed by an 
operably linked processing sequence; the translation of a nucleic acid sequence is 
directed by an operably linked translational regulatory sequence; the transport or 

;0 localization of a nucleic acid or polypeptide is directed by an operably linked 
transport or localization sequence; and the post-translational processing of a 
polypeptide is directed by an operably linked processing sequence. Preferably a 
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nucleic acid sequence that is operably linked to a second nucleic acid sequence is 
covalently linked, either directly or indirectly, to such a sequence, although any 
effective three-dimensional association is acceptable. 

[0070] Purified, as used herein, means separated from many other compounds or 
5 entities, e.g., compounds or entities with which it normally occurs in nature. A 

compound or entity may be partially purified, substantially purified, or pure, where it 
is pure when it is removed from substantially all other compounds or entities, i.e., is 
preferably at least about 90%, more preferably at least about 91%, 92%, 93%, 94%, 
95%, 96%, 97%, 98%, 99%, or greater than 99% pure. 

1 0 [0071] The term regulatory sequence is used herein to describe a region of nucleic 
acid sequence that directs, enhances, or inhibits the expression (particularly 
transcription, but in some cases other events such as splicing or other processing, 
translation, etc.) of sequence(s) with which it is operatively linked. The term includes 
promoters, enhancers and other transcriptional control elements. In some 

15 embodiments of the invention, regulatory sequences may direct constitutive 

expression of a nucleotide sequence; in other embodiments, regulatory sequences may 
direct tissue-specific and/or inducible expression. For instance, non-limiting 
examples of tissue-specific promoters appropriate for use in mammalian cells include 
lymphoid-specific promoters (see, for example, Calame et al., Adv. Immunol. 43:235, 

20 1988) such as promoters of T cell receptors (see, e.g., Winoto et ah, EMBO J. 8:729, 
1989) and immunoglobulins (see, for example, Banerji et al., Cell 33:729, 1983; 
Queen et al, Cell 33:741, 1983), and neuron-specific promoters {e.g., the 
neurofilament promoter; Byrne et al., Proc. Natl. Acad. Sci. USA 86:5473, 1989). 
Developmentally-regulated promoters are also encompassed, including, for example, 

25 the murine hox promoters (Kessel et al., Science 249:374, 1990) and the a-fetoprotein 
promoter (Campes et al, Genes Dev. 3:537, 1989). In some embodiments of the 
invention regulatory sequences may direct expression of a nucleotide sequence only 
in cells that have been infected with an infectious agent. For example, the regulatory 
sequence may comprise a promoter and/or enhancer such as a virus-specific promoter 

30 or enhancer that is recognized by a viral protein, e.g., a viral polymerase, transcription 
factor, etc. 
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[0072] A short, interfering RNA (siRNA) comprises an RNA duplex that is 
approximately 19 basepairs long and optionally further comprises one or two single- 
stranded overhangs or loops. An siRNA may be formed from two RNA strands that 
hybridize together, or may alternatively be generated from a single RNA strand that 
5 includes a self-hybridizing portion. When siRNAs include one or more free strand 
ends, it is generally preferred that free 5' ends have phosphate groups, and free 3' 
ends have hydroxyl groups. siRNAs include a portion that hybridizes with a target 
transcript. In certain preferred embodiments of the invention, one strand of the 
siRNA is precisely complementary with a region of the target transcript, meaning that 

1 0 the siRNA hybridizes to the target transcript without a single mismatch. In other 
embodiments of the invention one or more mismatches between the siRNA and the 
targeted portion of the target transcript may exist. In most embodiments of the 
invention in which perfect complementarity is not achieved, it is generally preferred 
that any mismatches be located at or near the siRNA termini. 

1 5 [0073] The term short hairpin RNA refers to an RNA molecule comprising at least 
two complementary portions hybridized or capable of hybridizing to form a double- 
stranded structure sufficiently long to mediate RNAi (typically at least 19 base pairs 
in length), and at least one single-stranded portion, typically between approximately 1 
and 10 nucleotides in length that forms a loop. As described further below, shRNAs 

20 are thought to be processed into siRNAs by the conserved cellular RNAi machinery. 
Thus shRNAs are precursors of siRNAs and are similarly capable of inhibiting 
expression of a target transcript. 

[0074] The phrase structural protein as used herein refer to the proteins which are 
required for encapsidation (e.g., packaging) of a retroviral or lentiviral genome, and 

25 include Gag, Pol and Env. 

[0075] The term subject, as used herein, refers to any individual to whom a 
lentiviral vector of the invention is delivered for any purpose. Preferred subjects are 
mammals, particularly rodents (e.g., mice and rats), domesticated mammals (e.g., 
dogs, cats, etc.), primates, or humans. 

30 [0076] An siRNA or shRNA or an siRNA or shRNA sequence is considered to be 
targeted to target transcript for the purposes described herein if 1) the stability of the 
target transcript is reduced in the presence of the siRNA or shRNA as compared with 
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its absence; and/or 2) the siRNA or shRNA shows at least about 90%, more 
preferably at least about 91%, 92%, 93%, 94%, 95%, 96%, 97%, 98%, 99%, or 100% 
precise sequence complementarity with the target transcript for a stretch of at least 
about 17, more preferably at least about 18 or 19 to about 21-23 nucleotides; and/or 3) 
5 one strand of the siRNA or one of the self-complementary portions of the shRNA 
hybridizes to the target transcript under stringent conditions for hybridization of small 
(<50 nucleotide) RNA molecules in vitro and/or under conditions typically found 
within the cytoplasm or nucleus of mammalian cells. Since the effect of targeting a 
transcript is to reduce or inhibit expression of the gene that directs synthesis of the 

10 transcript, an siRNA or shRNA targeted to a transcript is also considered to target the 
gene that directs synthesis of the transcript even though the gene itself (i.e., genomic 
DNA) is not thought to interact with the siRNA, shRNA, or components of the 
cellular silencing machinery. Thus as used herein, an siRNA or shRNA that targets a 
gene is understood to target a transcript whose synthesis is directed by the gene. 

1 5 [0077] The term vector is used herein to refer to a nucleic acid molecule capable 
of mediating entry of, e.g., transferring, transporting, etc., another nucleic acid 
molecule into a cell. The transferred nucleic acid is generally linked to, e.g., inserted 
into, the vector nucleic acid molecule. A vector may include sequences that direct 
autonomous replication, or may include sequences sufficient to allow integration into 

20 host cell DNA. Useful vectors include, for example, plasmids, cosmids, and viral 
vectors. Useful viral vectors include, e.g., replication defective retroviruses, 
adenoviruses, adeno-associated viruses, and lentiviruses. As will be evident to one of 
ordinary skill in the art, viral vectors may include various viral components in 
addition to nucleic acid(s) that mediate entry of the transferred nucleic acid. Thus the 

25 term viral vector may refer either to a virus or viral particle capable of transferring a 
nucleic acid into a cell or to the transferred nucleic acid itself In particular, the terms 
"lentiviral vector", "lentiviral expression vector" may be used to refer to lentiviral 
transfer plasmids and/or lentiviral particles of the invention as described below. 

30 Detailed Description of Certain Preferred Embodiments of the Invention 

[00781 Retroviruses and retroviral vectors 
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[0079] The retrovirus family consists of a group of viruses with a diploid RNA 
genome that is reverse transcribed during the viral life cycle to yield a double- 
stranded DNA intermediate that stably integrates into the chromosomal DNA of a 
host cell. The integrated DNA intermediate is referred to as a provirus. As used 
5 herein, a provirus is "derived from" a virus or viral particle that delivers the nucleic 
acid from which the proviral DNA is reverse transcribed to the cytoplasm of the cell. 
The retroviral genome and proviral DNA include three genes referred to as gag,pol, 
and env, flanked by two long terminal repeat sequences (LTRs). The 5' and 3' LTRs 
contain elements that promote transcription (promoter-enhancer elements) and 

10 polyadenylation of viral RNA. The LTRs also include additional cis-acting sequences 
required for viral replication. In addition, the viral genome includes a packaging 
signal referred to as psi (Y) that is necessary for encapsidation (packaging) of the 
retroviral genome. As used herein, a packaging signal or psi sequence is any 
sequence sufficient to direct packaging of a nucleic acid whose sequence comprises 

15 the packaging signal. This includes naturally occurring psi sequences and also 
engineered variants thereof. 

[0080] Briefly, the normal infective cycle begins when the virus attaches to the 
surface of a susceptible cell through interaction with one or more cell surface 
receptors. The virus fuses with the cell membrane, and the viral core is delivered to 
20 the cytoplasm, where the viral matrix and capsid become dismantled, releasing the 
viral genome. Viral reverse transcriptase copies the RNA genome into DNA, which 
moves into the nucleus, where its integration into host cell DNA is catalyzed by the 
viral integrase enzyme. 

[0081] Once integrated into a host genome, viral DNA can remain dormant for 
25 long periods of time. When activated, the viral DNA is transcribed by host cell RNA 
polymerase. The resulting transcript is both a genome for a new virion and a 
transcript from which viral gag and gag-pol polyproteins are synthesized. These 
polyproteins are later processed into the matrix (MA), capsid (CA), and nucleocapsid 
(NC) proteins (in the case of gag), or the matrix, capsid, protease (PR), reverse 
30 transcriptase (RT), and integrase (INT) proteins (in the case of gag-pol). The full- 
length viral RNA transcript also yields transcripts that act as templates for synthesis 
of other viral proteins including envelope glycoproteins and, in the case of 
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Antiviruses, a number of regulatory proteins via various splicing events. Newly made 
Gag and Gag-Pol polyproteins associate with one another, with complete viral 
genomes, and with envelope proteins in the cell membrane so that a new viral particle 
begins to assemble at the membrane. As assembly continues, the structure extrudes 
5 from the cell, thereby acquiring a lipid coat punctuated with envelope glycoproteins. 
Further discussion of the retroviral life cycle and features and descriptions of 
retrovirus classification and taxonomy may be found in Coffin, J., et al (eds.), 
Retroviruses, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, 1997, and in 
Fields, B., et al. 9 Fields ' Virology, 4 th . ed., Philadelphia: Lippincott Williams and 
10 Wilkins; ISBN: 0781718325, 2001 . See also the Web site having URL 

www.ncbi.nlm.nih.gov/ICTVdb/ICTVdB, accessed October 11, 2002, providing a 
classification and information about viruses, of which retroviruses are entry 61 and 
lentiviruses are entry 61.0.6. 

[0082] The ability of retroviruses to enter host cells and to mediate the integration 
15 of heterologous nucleic acid sequences into the cellular genome (transduction) has led 
to their widespread use for in vitro and in vivo transfer and expression of nucleic 
acids, a process often referred to as gene transfer. However, the heterologous nucleic 
acid need not be a gene and need not encode a protein. As used herein, the term 
"gene transfer" refers to transfer of any nucleic acid. A transferred nucleic acid is 
20 "expressed" in a cell if the introduction of the nucleic acid into the cell results, either 
directly or indirectly (such as via reverse transcription, integration, transcription, and, 
in some cases, translation) in the presence of an expression product of the nucleic acid 
(e.g., an RNA transcript and/or a polypeptide) within the cell. 
[0083] Advantages of retroviral vector systems include: (i) efficient entry of 
25 genetic material (the vector genome) into cells; (ii) an efficient process of entry into 
the target cell nucleus; (iii) relatively high levels of gene expression in many settings; 

(iv) minimal pathological effects on target cells in the case of many retroviruses; and 

(v) the potential to target particular cellular subtypes through control of the vector- 
target cell binding and tissue-specific control of gene expression (e.g., using tissue- 

30 specific promoters and/or enhancers). 

[0084] In using a retrovirus for gene transfer, a foreign (not part of the wild type 
virus) sequence (e.g., a gene of interest) maybe inserted into the retroviral genome in 
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place of wild type retroviral sequences. When the retrovirus delivers its genome to a 
cell, the foreign sequence is also introduced into the cell and may then be integrated 
into the host's cellular DNA as in the case of a wild type retroviral genome. The 
sequence may then be transcribed by the host cell's transcriptional machinery. If the 
5 sequence includes a coding region, translation of the sequence within the host results 
in expression of the encoded protein by the host cell. The features described above 
have made retroviral vectors particularly attractive for gene therapy although they 
may be used in numerous other applications as described below. 
[0085] In order to improve their safety, many recombinant retroviruses designed 

10 for gene transfer are replication defective, i.e., the genome does not encode functional 
forms of all the proteins necessary for the complete infective cycle. For example, 
sequences encoding the structural proteins may be mutated or deleted. In particular, 
part or all of the sequence encoding the structural proteins may be replaced by a 
different nucleic acid sequence, i.e., a nucleic acid sequence that is to be introduced 

15 into a target cell. However, the packaging signal remains intact. The nucleic acid 

sequence may include a promoter or its transcription may be under control of the viral 
LTR promoter-enhancer. In order to produce infectious viral particles that can be 
used to deliver the recombinant genome to cells, the required viral proteins are 
provided in trans. This may be accomplished using a variety of approaches as further 

20 described below. 

[0086] Lentiviruses and lentiviral vectors 

[0087] Lentiviruses are a family of retroviruses that differ from the simple 
retroviruses described above in that their genome includes any of a variety of genes in 
addition to Gag, Pol, and Env and may also include various regulatory elements. The 

25 additional genes encode typically include regulatory proteins such as Vif, Vpr, Vpu, 
Tat, Rev, and Nef. (For a discussion of various transcripts present at different times 
during the life cycle of HIV, see, for example, Kim et al, J. Virol. 63:3708, 1989, 
incorporated herein by reference). Further discussion of the lentiviral life cycle and 
features and descriptions of lentivirus classification and taxonomy may be found in 

30 Coffin, J., et al (eds.), Retroviruses, Cold Spring Harbor Laboratory Press, Cold 

Spring Harbor, 1997, and in Fields, B., et al, Fields ' Virology, 4 th . ed., Philadelphia: 
Lippincott Williams and Wilkins; ISBN: 0781718325, 2001. 
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[0088] The fact that retroviruses cannot effectively direct integration into the 
genome of nondividing cells has limited their use for introducing genes into many 
important targets such as liver, skeletal muscle, heart, brain, retina, and various cells 
of the hematopoietic system. In contrast, lentiviruses are able to productively infect 
5 and transduce nondividing cells, which has motivated the development of lenti viral 
vectors for gene transfer 20,21 . For example, lentiviruses are able to infect resting T 
cells, dendritic cells, and macrophages. Lentiviral vectors can also transfer genes to 
hematopoietic stem cells with a superior gene transfer efficiency and without 
affecting the repopulating capacity of these cells. Lentiviral vectors can also 

10 transduce liver, skeletal muscle, retina, and neuronal cells. See, e.g., Mautino and 
Morgan, AIDS Patient Care STDS 2002 Jan;16(l):l 1-26; Somia, N., et al. J. Virol 
74(9): 4420-4424, 2000; Miyoshi, H., et al, Science 283: 682-686, 1999; US patent 
6,013,516, and references 21 and 24. In addition, lentiviruses display reduced 
susceptibility to developmental silencing relative to simple retroviruses (24). This 

15 feature enables their use for the creation of transgenic animals, which is impractical 
with simple retroviruses because developmental silencing results in low or 
undetectable levels of transgene expression. 

[0089J As mentioned above, to enhance safety recombinant retroviruses and 
lentiviruses designed for gene transfer are typically replication defective, i.e., the 

20 genome does not encode functional forms of all the proteins necessary for the 
complete infective cycle. The necessary proteins are therefore provided in trans. 
According to one approach, these proteins are provided by a packaging cell that has 
been engineered to produce the proteins. Methods for preparing packaging cell lines 
that express retrovirus proteins are well known in the art (See, e.g., U.S. Pat. No. 

25 4,650,764 to Temin et al, U.S. Patent No. 5,955,331 to Danos, et al, Sheridan et al, 
Molecular TJierapy 2(3):262-275, Sep., 2000). Known packaging cell lines include 
¥2, PA137, and PA12, among others. 

[0090] In the absence of a nucleic acid sequence containing appropriate packaging 
signals, the packaging cell produces empty virions. When a nucleic acid sequence 
30 containing appropriate packaging signals is present within the packaging cell (as may 
be achieved by either stably or transiently transfecting the cell with a construct 
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capable of directing transcription of such a sequence), the sequence can be packaged, 
yielding infectious viral particles* The resulting cell is referred to as a producer cell 
[0091] In the context of certain embodiments of the present invention, a 
packaging cell will comprise a host cell containing packaging-signal defective nucleic 
5 acid sequence(s) coding for retroviral protein(s). The cell is thus able to produce 
retroviral protein(s) but unable to produce replication-competent infectious virus. 
Packaging cells may be created by transfecting a host cell (e.g., a human 293T cell) 
with one or more nucleic acid sequences encoding such protein(s) according to known 
procedures. Any suitable combination of expression cassettes capable of driving 
10 synthesis of the required proteins is sufficient. Typically the packaging cell line 
contains (i) a modified retroviral genome encoding functional Gag and Gag-Pol 
polyproteins but unable to produce functional envelope protein; and (ii) a sequence 
encoding an envelope protein. 

[0092] The various proteins need not all originate from the same viral species. 

1 5 For example, the Gag and Pol proteins may be derived from any of a wide variety of 
retroviruses or lentiviruses. According to certain preferred embodiments of the 
invention the gag and pol proteins are derived from a lentivirus. According to certain 
embodiments of the invention the gag and pol proteins are derived from HIV. Many 
different types of host cell maybe used, provided that the cells are permissive for 

20 transcription from the promoters employed. Suitable host cells include, for example, 
293 cells and derivatives thereof such as, 293.T, 293FT (Invitrogen), 293F, etc., 
NIH3T3 cells, etc. In general, any mammalian cell that supports transfection and can 
be grown in sufficient quantities can be used. One of ordinary skill in the art will be 
able to select appropriate host cells. 

25 [0093] Although an envelope derived from the same retrovirus or lentivirus from 
which the other viral proteins are derived can be used (homologous envelope) the use 
of a nonhomologous envelope protein such as the VSV G glycoprotein significantly 
reduces or eliminates the possibility of generating wild-type virus during vector 
manufacturing or after introduction of the vectors into host cells. Thus one useful 

30 class of lentiviral vectors consists of replication-defective, hybrid viral particles made 
from the core proteins and enzymes of a lentivirus and the envelope of a different 
virus such as the vesicular stomatitis virus (VSV) or the Moloney leukemia virus. 
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[0094] Safety considerations prompted development of alternative approaches to 
the production of recombinant retroviral and lentiviral particles capable of infecting 
and transducing cells. According to an approach described in U.S. Patent Number 
6,013,516 and also in references 19 and 20, three different constructs may be used to 
5 produce the recombinant lentiviral particles. Two of these constructs provide 
packaging functions, one containing sequences encoding the core proteins and 
enzymes of the lentivirus and the other containing sequences encoding the envelope 
protein of a different related or unrelated virus. 

[0095] The third construct, referred to herein as a transfer construct, transfer 
10 vector, or transfer plasmid, includes a cloning site for insertion of a heterologous 
nucleic acid (i.e., a sequence not derived from the lentivirus) in addition to the cis- 
acting viral sequences that are necessary for certain aspects of the viral life cycle such 
as encapsidation, reverse transcription, and integration. 

[0096] The three plasmid system, which does not require helper virus, and use of 
15 a heterologous envelope improve the safety of the vector by reducing the likelihood 
that a replication-competent recombinant could be generated. In addition, removal of 
various non-essential cis-acting sequences and the discovery that sequences encoding 
certain viral proteins can be removed while still allowing efficient gene transfer 
further contributes to the safety of this system. These advances are reviewed in 
20 reference 21 and articles listed therein, all of which are incorporated herein by 
reference. 

[0097] The present invention provides new lentiviral transfer plasmids, new 
replication-defective lentiviruses, and new lentiviral expression systems. Maps of 
exemplary lentiviral transfer constructs of the invention are provided in Figures 2 

25 through 9 and corresponding sequences are provided as SEQ ID NOS: 2 through 9. 
However, the invention is not limited to these specific embodiments. Figure 2 shows 
a map of one of the transfer plasmids of the invention in which nucleotide 0 is 
indicated. For purposes of description, nucleotides are numbered in a clockwise 
direction with reference to nucleotide 0, and elements having lower nucleotide 

30 numbers are considered 5' to elements having higher nucleotide numbers. Thus, for 
example, the CMV element is 5' to all other elements shown. Note that various 
elements depicted in the maps are not shown to scale. Also, the presence of a 
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particular element on a map is not intended to indicate that the entire element is 
necessarily present. For example, according to certain embodiments of the invention 
a portion of the 5' LTR is deleted. 

[0098] According to certain embodiments of the invention the lentiviral transfer 
5 plasmids are HTV-based lentiviral transfer plasmids. As used herein, a lentiviral 
plasmid is said to be "based on" a particular lentivirus species (e.g., HIV-1) or group 
(e.g., primate lentivirus group) if at least 50% of the lentiviral sequences found in the 
plasmid are derived from a lentivirus of that particular species or group, alternately, if 
the transfer plasmid displays greater identity or homology to a lentivirus of that 

10 particular species or group than to other known lentiviruses. Thus a HIV-based 

lentiviral transfer plasmid is a transfer plasmid in which at least 50% of the lentiviral 
sequences are derived from (i.e., originate from), either HIV-1 or HTV-2 or, 
alternately, if the transfer plasmid displays greater identity or homology to HIV-1 or 
HIV-2 than to other known lentiviruses. In cases where the origin of any given 

15 sequence is unknown, the likelihood that it is derived from a particular lentivirus may 
be determined by sequence comparison using, e.g., programs such as BLAST, 
BLASTNR, or CLUSTALW (or variations thereof) in a comprehensive database such 
as GenBank, Unigene, etc., can be performed using, e.g., default parameters and 
matrices (e.g., BLOSUM substitution matrix). (BLAST is described in Altschul, SF, 

20 et al., Basic local alignment search tool, J. Mol Biol, 215(3): 403-410, 1990, Altchul, 
SF and Gish, W, Methods in Enzymology. 

[0099] The invention provides a lentiviral transfer plasmid whose sequence 
comprises a nucleic acid sequence including (i) a functional packaging signal; (ii) a 
multiple cloning site (MCS); and (iii) at least one additional element selected from the 

25 group consisting of: a second MGS, a second MCS into which a heterologous 

promoter or promoter-enhancer is inserted, an HIV FLAP element, an expression- 
enhancing posttranscriptional regulatory element, a target site for a site-specific 
recombinase, and a self-inactivating (SIN) LTR . It is to be understood that the target 
site for a site-specific recombinase is in addition to any site(s) required for integration 

30 of the lentiviral genome. In other words, in those embodiments of the invention in 
which the additional element is a target site for a site-specific recombinase, the 
lentiviral transfer plasmid will typically also include target sites for the corresponding 
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lentiviral integrase (which normally exist within the LTRs). In particular, the 
invention provides (1) a lentiviral transfer construct as described immediately above 
wherein the additional element is a second MCS; (2) a lentiviral transfer construct as 
described immediately above wherein the additional element is a second MCS in 
5 which a heterologous promoter or promoter-enhancer is inserted; (3) a lentiviral 
transfer construct as described immediately above wherein the additional element is 
an HIV FLAP element; (4) a lentiviral transfer construct as described immediately 
above wherein the additional element is an expression-enhancing posttranscriptional 
regulatory element such as the woodchuck hepatitis virus regulatory element (WRE); 

10 (5) a lentiviral transfer construct as described immediately above wherein the 

additional element is a recombination site for a site-specific recombinase; and (6) a 
lentiviral transfer construct as described immediately above wherein the additional 
element is a SIN LTR. The lentiviral transfer plasmid may also comprise one or 
more heterologous promoters, enhancers, or promoter-enhancers. 

1 5 [00100] The invention further provides lentiviral transfer plasmids containing at 
least two, at least three, at least four, at least five, or all of these additional elements. 
In particular, the invention provides a lentiviral tranfer plasmid comprising a nucleic 
acid sequence that includes (i) a functional packaging signal; (ii) a multiple cloning 
site (MCS); (iii) a second MCS; (iv) a second MCS in which a heterologous promoter 

20 or a heterologous promoter-enhancer is inserted; (v) an HTV FLAP element; (vi) a 
WRE; (vii) two loxP sites; and a self-inactivating (SIN) LTR . The invention also 
encompasses lentiviral transfer plasmids as described above in which a heterologous 
nucleic acid is inserted at a site within an MCS. It will be appreciated that insertion of 
such a sequence separates the MCS into two parts. 

25 [00101] According to preferred embodiments of the invention the transfer plasmid 
includes the cis-acting sequence elements required to support reverse transcription of 
a lentiviral genome and also the cis-acting sequence elements necessary for the 
packaging and integration of a lentiviral genome. These sequences typically include 
the Psi (¥) packaging sequence, reverse transcription signals, integration signals, 

30 promoter or promoter/enhancer, polyadenylation sequence, tRNA binding site, and 
origin for second strand DNA synthesis. According to certain embodiments of the 
invention the transfer plasmid contains a Rev Response Element (RRE) such as that 
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located at positions 7622-8459 in the HTV NL4-3 genome (Genbank accession 
number AF003887). Of course RREs from other strains of HIV can also be used. 
Such sequences are readily available from Genbank or from the database having URL 
hiv-web.lanl.gov/content/index. According to certain embodiments of the invention 
5 the transfer plasmid contains a 5 1 HIV R-U5-del gag element such as that located at 
positions 454-1 126 in the HIV NL4-3 genome. According to preferred embodiments 
of the invention the transfer plasmid contains a sequence encoding a selectable marker 
such as the ampicillin resistance gene (Amp R ) and an origin of replication that allows 
the plasmid to replicate within bacterial cells, such as the pUC origin. Various 
10 features and elements mentioned above (and others) are more fully described in the 
following sections. 

[00102] Lentiviral genome sequences. The lentiviral transfer plasmids may include 
lentiviral sequences derived from any of a wide variety of lentiviruses including, but 
not limited to, primate lentivirus group viruses such as human immunodeficiency 

15 viruses HIV-1 and HIV-2 or simian immunodeficiency virus (SIV); feline lentivirus 
group viruses such as feline immunodeficiency virus (FIV); ovine/caprine 
immunodeficieny group viruses such as caprine arthritis encephalitis virus (CAEV); 
bovine immunodeficiency-like virus (BIV); equine lentivirus group viruses such as 
equine infectious anemia virus; and visna/maedi virus. It will be appreciated that each 

20 of these viruses exists in multiple variants or strains. 

[00103] According to certain preferred embodiments of the invention most or all of 
the lentiviral sequences are derived from HIV-1. For example, according to certain 
embodiments of the invention the lentiviral backbone of the transfer plasmids is 
derived from an HIV-l-based transfer plasmid such as that described in reference 29 

25 or derivatives thereof such as those described in reference 24. However, it is to be 
understood that many different sources of lentiviral sequences can be used, and 
numerous substitutions and alterations in certain of the lentiviral sequences may be 
accommodated without impairing the ability of the transfer plasmid to perform the 
functions described herein, and such variations are within the scope of the invention. 

30 The ability of any particular lentiviral transfer plasmid to transfer nucleic acids and/or 
to generate a lentiviral particle capable of infecting and transducing cells in the 



Page 27 of 171 



WO 2004/022722 



PCT/US2003/028111 



presence of the required viral proteins may readily be tested by methods known in the 
art, some of which will be evident from the Examples. 

[00104] Unique restriction sites and multiple cloning sites. The invention provides 
new lentiviral transfer plasmids incorporating sites for a variety of different restriction 
5 enzymes. In particular, the invention provides lentiviral transfer constructs including 
one or more multiple cloning sites (MCS), e.g., one MCS or two MCSs. As is well 
known in the art, a multiple cloning site, also referred to as a polylinker, or 
polycloning site, is a cluster of cloning sites such that many restriction enzymes 
operate within the site. A cloning site as used herein is a known sequence, preferably 

10 the only one on the plasmid, (i.e., it is a unique sequence on the plasmid) upon which 
a restriction enzyme operates to linearize or cut the plasmid. Restriction sites for 
numerous restriction enzymes are known in the art and are listed, for example, in the 
catalogs of various manufacturers such as New England Biolabs, Promega, Beoringer- 
Ltigelheim, etc. For purposes of the present invention a restriction site is unique if it is 

15 recognized as such in the art or, alternately, if the enzyme displays at least a 5-fold 
greater likelihood of cutting at the unique site than at any other site in the plasmid 
under standard digestion conditions. 

[00105] Typically an MCS is less than approximately 100 nucleotides in length 
(measured from the most 5' nucleotide in the most 5' restriction site to the most 3' 

20 nucleotide in the most 3' restriction site, and including both of these nucleotides) and 
contains at least 4 unique restriction sites. According to certain embodiments of the 
invention an MCS is less than approximately 100 nucleotides in length. According to 
certain embodiments of the invention an MCS is less than approximately 75 
nucleotides in length. According to certain embodiments of the invention an MCS is 

25 less than approximately 50 nucleotides in length. According to certain embodiments 
of the invention the transfer plasmid comprises an MCS containing at least 5 unique 
restriction sites. According to other embodiments of the invention the transfer 
plasmid comprises an MCS containing at least 6 unique restriction sites. According to 
yet other embodiments of the invention the transfer plasmid comprises an MCS 

30 containing at least 7 unique restriction sites. According to yet other embodiments of 
the invention the transfer plasmid comprises an MCS containing at least 8 unique 
restriction sites. According to yet other embodiments of the invention the transfer 
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plasmid comprises an MCS containing at least 9 unique restriction sites. According to 
yet other embodiments of the invention the transfer plasmid comprises at least two 
MCSs, each of which contains at least 7 unique restriction sites. The invention 
provides a lentiviral transfer plasmid containing an MCS that includes a site for a 
5 restriction enzyme that leaves a blunt end after cutting. The invention further 

provides a lentiviral transfer plasmid containing an MCS that includes a restriction 
site that has an 8 bp recognition sequence. 

[00106] The invention provides a lentiviral transfer plasmid having unique 
restriction sites for at least 4 enzymes selected from the group consisting of NotI, 

10 Apal, Xhol, Xbal, Hpal, Nhel, Pad, Nsil, Sphl, Sma/Xma, AccI, BamHI, and Sphl. 
The invention further provides a lentiviral transfer plasmid having unique restriction 
sites for at least 5, at least 6, at least 7, at least 8, at least 9, at least 10, at least 11, at 
least 12, or at least 13 enzymes selected from the group consisting of NotI, Apal, 
Xhol, Xbal, Hpal, Nhel, Pad, Nsil, Sphl, Sma/Xma, AccI, BamHI, and Sphl. The 

15 invention further provides collections of two or more of any of the lentiviral transfer 
plasmids described above. According to certain embodiments of the invention any of 
the lentiviral transfer plasmids described above are HIV-based transfer plasmids. 
[00107J HIV FLAP element According to certain embodiments of the invention 
the transfer plasmid includes an HIV FLAP element. This sequence contains 

20 structural elements associated with the process of reverse transcription and 

encompasses the central polypurine tract and central termination sequences (cPPT 
and CTS). As described in Zennou, et aL,Cell, 101, 173, (2000), during HIV-1 
reverse transcription, central initiation of the plus-strand DNA at the central 
polypurine tract (cPPT) and central termination at the central termination sequence 

25 (CTS) lead to the formation of a three-stranded DNA structure: the HIV-1 central 
DNA flap. While not wishing to be bound by any theory, the DNA flap may act as a 
cis-active determinant of lentiviral genome nuclear import and/or may increase the 
titer of the virus. 

[00108] Expression-stimulating posttranscriptional regulatory element The 
30 invention provides lentiviral transfer plasmids comprising any of a variety of 

posttranscriptional regulatory elements characterized in that their presence within a 
transcript increases expression of the heterologous nucleic acid at the protein level. 
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According to certain embodiments of the invention the posttranscriptional regulatory 
element is the woodchuck hepatitis virus regulatory element (WRE) as described in 
Zufferey,R., J. Virol, 73, 2886, 1999. Other posttranscriptional processing 
elements that may be used include the posttranscriptional processing element present 
5 within the genome of various viruses such as that present within the thymidine kinase 
gene of herpes simplex virus (Liu, X., and J. E. Mertz. Genes De\>. 9:1766-1780, 
1995), and the posttranscriptional regulatory element (PRE) present in hepatitis B 
virus (HBV) (Huang, Z. M., and T. S. Yen, Mol Cell Biol 5:3864-3869, 1995). 
According to the invention the posttranscriptional regulatory element is positioned so 

10 that a heterologous nucleic acid inserted into the transfer plasmid in the 5' directly 
from the element will result in production of a transcript that includes the 
posttranscriptional regulatory element at the 3 5 end. Figure 2 shows an example of a 
transfer plasmid incorporating the WRE downstream of sites for insertion of one or 
more heterologous nucleic acid sequences. Figure 6 shows an example of a transfer 

15 plasmid in which a heteologous nucleic acid encoding EGFP has been inserted in the 
5' direction from the WRE and the ubiqutin C (UbC) promoter has been inserted 
upstream of the sequence encoding EGFP. This configuration results in synthesis of a 
transcript whose 5 5 portion comprises EGFP coding sequences and whose V portion 
comprises the WRE sequence. 

20 [00109] Long terminal repeats. According to certain embodiments of the invention 
the transfer plasmid includes a self-inactivating (SIN) LTR (29). As is known in the 
art, during the retroviral life cycle, the U3 region of the 3' LTR is duplicated to form 
the corresponding region of the 5' LTR in the course of reverse transcription and viral 
DNA synthesis. Creation of a SIN LTR is achieved by inactivating the U3 region of 

25 the 3' LTR (preferably by deletion of a portion thereof as described in reference 29). 
The alteration is transferred to the 5' LTR after reverse transcription, thus eliminating 
the transcriptional unit of the LTRs in the pro virus, which should prevent 
mobilization by replication competent virus. An additional safety enhancement is 
provided by replacing the U3 region of the 5' LTR with a heterologous promoter to 

30 drive transcription of the viral genome during production of viral particles. 

Appropriate promoters include, e.g., the CMV promoter. Preferred promoters are able 
to drive high levels of transcription in a Tat-independent manner. This replacement 
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further reduces the possibility of recombination to generate replication-competent 
virus because there is no complete U3 sequence in the virus production system. Thus 
in certain embodiments of the invention the transfer plasmid includes a self- 
inactivating (SIN) 3 5 LTR. In certain embodiments of the invention the transfer 

5 plasmid includes a 5' LTR in which the U3 region is replaced with a heterologous 
promoter. The heterologous promoter drives transcription during transient 
transfection but after reverse transcription it gets replaced by a copy of U3 from the 3 1 
LTR, which in the case of a SIN LTR contains a deletion that makes it unable to drive 
transcription. Thus all transcription is driven by the internal promoter after 

0 integration. 

[00110] According to certain embodiments of the invention one or both LTRs 
contain sequences that can be used to introduce insulator sequences into the vectors. 
In general, insulators are elements that can help to preserve the independent function 
of genes or transcription units embedded in a genome or genetic context in which 

5 their expression may otherwise be influenced by regulatory signals within the genome 
or genetic context. See, for example, Burgess-Beusse B, et al., Proc. Natl Acad. Sci. 
published August 1, 2002, 10.1073/pnas. 162342499 and Zhan HC, et al, Hum Genet, 
Nov;109(5):471-8, 2001. In the context of the present invention, insulators "protect" 
the lentivirus-expressed sequences from integration site effects, which are mediated 

:0 by cis-acting elements present in genomic DNA, and lead to deregulated expression 
of transferred sequences. The invention provides transfer plasmids in which an 
insulator sequence is inserted into one or both LTRs. 

[00111] Heterologous promoters and promoter /enhancers. Any of a wide variety 
of heterologous promoter and promoter/enhancer elements may be included in the 

15 transfer plasmids and used to direct transcription of a heterologous nucleic acid 

sequence in cells infected with the recombinant lentiviral particles of the invention or 
cells into which the transfer plasmids of the invention have been introduced, e.g., by 
transfection. According to certain embodiments of the invention the transfer plasmids 
and lentiviral particles include a single heterologous promoter. In other embodiments 

i0 two or more heterologous promoters are included. The promoters may be in the same 
or in opposite orientation. 
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[00112] One of ordinary skill in the art will readily be able to selet appropriate 
promoters depending upon the particular application. For example, sometimes it will 
be desirable to achieve constitutive, non-tissue specific, high level expression of a 
heterologous nucleic acid sequence. For such purposes viral promoters or 
5 promoter/enhancers such as the SV40 promoter, CMV promoter or 

promoter/enhancer, etc., may be employed. Mammalian promoters such as the beta- 
actin promoter, ubiquitin C promoter, elongation factor la promoter, tubulin 
promoter, etc., may also be used. If the plasmids are to be used in non-mammalian 
cells, appropriate promoters for such cells should be selected. 

10 [00113] It may be desirable to achieve cell type specific or tissue-specific 
expression of a heterologous nucleic acid sequence (e.g., to express a particular 
heterologous nucleic acid in only a subset of cell types or tissues or during specific 
stages of development), tissue-specific promoters may be used. For example, it may 
be desirable to achieve conditional expression in the case of transgenic animals or for 

15 therapeutic applications, including gene therapy. As used herein, the term "tissue 
specific promoter" refers to a regulatory element (e.g., promoter, promoter/enhancer 
or portion thereof) that preferentially directs transcription in only a subset of cell or 
tissue types, or during discrete stages in the development of a cell, tissue, or organism. 
A tissue specific promoter may direct transcription in only a single cell type or in 

20 multiple cell types (e.g., two to several different cell types). Numerous tissue-specific 
promoters are known, and one of ordinary skill in the art will readily be able to 
identify tissue specific promoters (or to determine whether any particular promoter is 
a tissue specific promoter) from the literature or by performing experiments such as 
Northern blots, immunoblots, etc. in which expression of either an endogenous gene 

25 or a reporter gene operably linked to the promoter is compared in different cell or 
tissue types). For example, the nestin, neural specific enolase, NeuN, and GFAP 
promoters direct transcription in various neural or glial lineage cells; the keratin 5 
promoter directs transcription in keratinocytes; the MyoD promoter directs 
transcription in skeletal muscle cells; the insulin promoter directs transcription in 

30 pancreatic beta cells; the CYP450 3A4 promoter directs transcription in hepatocytes! 
The invention therefore provides lentiviral transfer plasmids as described above 
comprising a tissue-specific promoter and methods of using the transfer plasmids and 
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lentiviral particles derived therefrom to achieve cell type or tissue specific expression. 
Preferred promoters are active in mammalian cells. According to certain 
embodiments of the invention the tissue-specific promoter is specific for brain (e.g., 
neurons), liver (e.g., hepatocytes), pancreas, skeletal muscle (e.g., myocytes), immune 
5 system cells (e.g., T cells, B cells, macrophages), heart (e.g., cardiac myocytes), 
retina, skin (e.g., keratinocytes), bone (e.g., osteoblasts or osteoclasts), etc. 
[00114] It may be desirable to achieve conditional expression of a heterologous 
nucleic acid sequence (e.g., to control expression of a particular heterologous nucleic 
acid by subjecting a cell, tissue, organism, etc., to a treatment or condition that causes 

10 the heterologous nucleic acid to be expressed or that causes an increase or decrease in 
expression of the heterologous nucleic acid), for which purpose a variety of inducible 
promoters and systems. In particular, it may be desirable to achieve conditional 
expression in the case of transgenic animals or for therapeutic applications, including 
gene therapy. See, e.g., Haviv YS and Curiel DT, Adv Drug Deliv Rev, 53(2): 1 35-54, 

15 2001, describing approaches for achieving conditional gene expression in cancer cells. 
As used herein, "conditional expression" may refer to any type of conditional 
expression including, but not limited to: inducible expression; repressible expression; 
expression in cells or tissues having a particular physiological, biological, or disease 
state, etc. This definition is not intended to exclude cell type or tissue-specific 

20 expression, since the type of cell or tissue may also be considered a condition. 

[00115] One approach to achieving conditional expression involves the use of 
inducible promoters. As used herein, the term "inducible promoter" refers to a 
regulatory element (e.g., a promoter, promo ter/enhancer or portion thereof) whose 
transcriptional activity may be regulated by exposing a cell or tissue containing a 

25 nucleic acid sequence operably linked to the promoter to a treatment or condition that 
alters the transcriptional activity of the promoter, resulting in increased transcription 
of the nucleic acid sequence. For convenience, as used herein, the term "inducible 
promoter" also includes repressible promoters, i.e., promoters whose transcriptional 
activity may be regulated by exposing a cell or tissue containing a nucleic acid 

30 sequence operably linked to the promoter to a treatment or condition that alters the 
transcriptional activity of the promoter, resulting in decreased transcription of the 
nucleic acid sequence. Preferred inducible promoters are active in mammalian cells. 
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Inducible promoters include, but are not limited to steroid-inducible promoters such 
as the promoters for the genes encoding the glucocorticoid or estrogen receptors 
(inducible by treatment with the corresponding hormone), metallothionine promoter 
(inducible by treatment with various heavy metals), MX-1 promoter (inducible by 
5 interferon), etc. The invention therefore provides lentiviral transfer plasmids as 
described above comprising a tissue-specific promoter and methods of using the 
transfer plasmids and lentiviral particles derived therefrom to achieve cell type or 
tissue specific expression. 

[00116] Another approach to achieving conditional expression involves use of 

10 binary transgenic systems, in which gene expression is controlled by the interaction of 
two components: a "target" transgene and an "effector" transgene, whose product acts 
on the target transgene. See, e.g., Lewandoski, M., Nature Reviews Genetics 2, 743- 
755 (2001) and articles referenced therein, all of which are incorporated herein by 
reference, reviewing methods for achieving conditional expression in mice. In 

15 general, binary transgenic systems fall into two categories. In the first type of system, 
the effector transactivates transcription of the target transgene. For example, in the 
tetracycline-dependent regulatory systems (Gossen, M. & Bujard, H, Proc, Natl Acad. 
Sci. USA 89, 5547-5551 (1992), the effector is a fusion of sequences that encode the 
VP 16 transactivation domain and the Escherichia coli tetracycline repressor (TetR) 

20 protein, which specifically binds both tetracycline and the 1 9~bp operator sequences 
(tetO) of the tet operon in the target transgene, resulting in its transcription. In the 
original system, the tetracycline-controlled transactivator (tTA) cannot bind DNA 
when the inducer is present, while in a modified version, the 'reverse tTA' (rtTA) 
binds DNA only when the inducer is present ('tet-on ! ) (Gossen, M. et al. 9 Science 268, 

25 1766-1769(1995)). The current inducer of choice is doxycycline (Dox). The 
invention therefore provides lentiviral transfer plasmids as described above 
comprising a tetracycline-controlled transactivator or reverse tetracycline-controlled 
transactivator, lentiviral transfer plasmids comprising operator sequences of the tet 
operon to which the tetracycline-controlled transactivator or reverse tetracycline- 

30 controlled transactivator specifically bind, and methods of using the transfer plasmids 
and lentiviral particles derived therefrom to achieve conditional expression, including 
the generation of transgenic animals in which conditional expression is achieved. 



Page 34 of 171 



WO 2004/022722 



PCT/US2003/028111 



[00117] In the second type of system, the effector is a site-specific DNA 
recombinase that rearranges the target gene, thereby activating or silencing it. These 
systems are described below. In order to achieve conditional expression in cells or 
tissues having a particular physiological, biological, or disease state, a promoter that 
5 is selectively active in cells or tissue having that particular physiological, biological, 
or disease state may be used. 

[001 18] As described further below, one application for the lentiviral transfer 
plasmids and lentiviral expression systems of the invention is to direct transcription of 
RNAs that hybridize or self-hybridize to form siRNAs or shRNAs in cells, e.g., 

10 mammalian cells. For these purposes in certain embodiments of the invention it is 
preferred to use a Poim promoter such as the U6 or HI promoter. Therefore, the 
invention provides lentiviral transfer plasmids and lentiviral particles optimized for 
siRNA, i.e., lentiviral transfer plasmids and lentiviral particles comprising a Poim 
promoter, e.g., the U6 or HI promoter. According to certain embodiments of the 

15 invention the Poim promoter is inducible. It is noted that Pol II promoters can also 
be used to achieve intracellular expression of siRNA or shRNA (Xia, H., et aL 9 Nat. 
Biotech., 20: 1006-1010, 2002), and the lentiviral vectors described herein may be 
used in this manner. 

[00119] Transfer plasmid size. As described in further detail in Example 1, by 
20 removing certain dispensable sequences the inventors have created lentiviral transfer 
plasmids having reduced size relative to previously known lentiviral transfer 
plasmids, which results in a number of advantages. First, the reduced size of the 
transfer constructs adds to their ease of manipulability. Second, the reduced size adds 
to their flexibility. As is known in the art, there is a limit to the size of retroviral 
25 genomes that can be efficiently packaged. Generally it is preferable to limit the size 
of the transcript for packaging (distance between 5' and 3' UTRs) to less than 
approximately 8-10 kB. Thus removal of the dispensable sequences allows the 
insertion of larger heterologous sequence(s) without compromising the ability of the 
resulting genomic transcript to be packaged during the production of lentiviral 
30 particles. As used herein in reference to retroviral and lentiviral vectors, a "genome" 
or "genomic transcript" generally refers to a transcript that contains sufficient 
packaging signals to allow packaging. It does not imply that the transcript need 
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contain all or even most of the genetic information found in a wild type virus. In 
general, the sequence of a genomic transcript will depend on the location of the 
promoter upstream of the packaging sequence and the location of the polyadenylation 
site downstream of the packaging sequence. 
5 [00120] The invention provides a lentiviral transfer plasmid having a length less 
than 10 kilobases (kB). The invention provides a lentiviral transfer plasmid having a 
length less than 9 kB. The invention provides a lentiviral transfer plasmid having a 
length less than 8 kB). The invention provides a lentiviral transfer plasmid having a 
length less than 7 kB). The invention provides a lentiviral transfer plasmid having a 

10 length less than 6.5 kB). The invention provides a lentiviral transfer plasmid having a 
length of approximately 6 kB). Generally, unless otherwise evident from the context, 
the term "approximately" means that the value may deviate by 10% or less from the 
numeral given, and the ranges listed are assumed to include both endpoints. The 
invention further provides collections of lentiviral plasmids having a length less than 

15 10 kB, a length less than 9 kB, a length less than 8 kB, or a length less than 7 kB. 

[00121] In particular, the invention provides a lentiviral transfer plasmid having a 
length less than 8 kB and comprising one or more heterologous nucleic acid 
sequences. According to certain embodiments of the invention the heterologous 
nucleic acid sequence is a promoter or promoter/enhancer such as the CMV promoter, 

20 the CMV promoter/enhancer, or the Ubiquitin C promoter. According to certain 
embodiments of the invention the promoter is the U6 or HI promoter. According to 
certain embodiments of the invention the heterologous nucleic acid sequence is a 
reporter gene, e.g., a gene encoding EGFP or dsRed2. The invention particularly 
provides a lentiviral transfer plasmid having a length of approximately 6.0 kB 

25 comprising at least one MCS, two LoxP sites, an HIV FLAP element, and a WRE. 
[00122] Transfer plasmid sequence information. The inventors have recognized 
that prior art lentiviral vector systems suffered from a dearth of sequence information. 
As will be readily appreciated by one of ordinary skill in the art, regardless of the 
particular nature of a transfer plasmid, it is desirable to have complete and accurate 

30 sequence information. Such information makes it possible, for example, to readily 
determine the identity of all restriction sites, to design primers for amplification of 
particular plasmid sequences or for other purposes such as the introduction of 
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mutations, etc. In addition, the availability of complete sequence information makes 
it possible to identify determinants of plasmid function, e.g., by engineering mutations 
at specific sites and observing the effect on, for example, packaging, integration, 
transcription, etc. Accordingly, the invention provides a fully sequenced lentiviral 
5 transfer plasmid, wherein the sequence is deposited in a publicly accessible database. 
By "fully sequenced" is meant that the complete nucleotide sequence of the plasmid is 
known. By "publicly accessible database" is meant Genbank, or any other database 
that can be accessed by the public without requiring a fee. In particular, the invention 
provides a fully sequenced lentiviral transfer plasmid comprising the sequence set 

10 forth in any of the following SEQ ID NOS: SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID 
NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID 
NO: 9. In addition, the invention provides a collection of lentiviral transfer plasmids 
including at least two of the plasmids having SEQ ID NOS: SEQ ID NO: 2, SEQ ID 
NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 

15 8, or SEQ ID NO: 9. In addition, the invention provides a lentiviral transfer plasmid 
having a sequence that differs by not more than 100 nucleotides from the sequence set 
forth in SEQ ID NOS: SEQ ID NO: 2, SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, 
SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, or SEQ ID NO: 9. The invention 
further provides a lentiviral transfer plasmid having a sequence that differs by not 

20 more than X nucleotides from the sequence set forth in SEQ ID NOS: SEQ ID NO: 2, 
SEQ ID NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ 
ID NO: 8, or SEQ ID NO: 9, where X represents any number between 1 and 99, 
inclusive. By "a sequence that differs by not more than X nucleotides (where X is 
any number) from the sequence of SEQ ID NO: Y" is meant any sequence that can be 

25 obtained from SEQ ID NO: Y by either inserting, deleting, and/or altering less than X 
nucleotides of SEQ ID NO: Y. 

[00123] Recombination sites for site-specific recombinase. According to certain 
embodiments of the invention the transfer plasmid includes at least one (typically 
two) site(s) for recombination mediated by a site-specific recombinase. Site-specific 
30 recombinases catalyze the introduction or excision of DNA fragments from a longer 
DNA molecule. These enzymes recognize a relatively short, unique nucleic acid 
sequence, which serves for both recognition and recombination. Typically the 
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recombination site is composed of short inverted repeats (6, 7 or 8 base pairs in 
length) and the length of the DNA-binding element is typiclly 1 1-13 bp in length. 
[00124] In general, the transfer plasmids may contain one or more recombination 
sites for any of a wide variety of site-specific recombinases. As mentioned above, it 
5 is to be understood that the target site for a site-specific recombinase is in addition to 
any site(s) required for integration of the lentiviral genome. According to various 
embodiments of the invention the transfer plasmid includes one or more sites for a 
recombinase enzyme selected from the group consisting of Cre, XerD, HP1 and Flp. 
These enzymes and their recombination sites are well known in the art. See, for 

10 example, Sauer, B. & Henderson, N., Nucleic Acids Res. 17, 147-161 

(1989), Gorman, C. and Bullock, C., Curr. Op. Biotechnol., 11(5): 455-460, 2000, 
O'Goiman, S., Fox, D. T. & Wahl, G. M., Science 251, 1351-1355 (1991) and Kolb, 
A., Cloning Stem Cells, 4(l):65-80, 2002, and U.S. Patent 4,959,317. See also Kuhn, 
R., and Torres, RM, Methods Mol Biol 2002;180:175-204. 

1 5 [00125] These recombinases catalyse a conservative DNA recombination event 
between two 34-bp recognition sites (loxP and FRT, respectively). Placing a 
heterologous nucleic acid sequence operably linked to a promoter element between 
two loxP sites (in which case the sequence is "floxed") allows for controlled 
expression of the heterologous sequence following transfer into a cell. By inducing 

20 expression of Cre within the cell, the heterologous nucleic acid sequence is excised, 
thus preventing further transcription and effectively eliminating expression of the 
sequence. This system has a number of applications including Cre-mediated gene 
activation (in which either heterologous or endogenous genes may be activated, e.g., 
by removal of an inhibitory element or a polyadenylation site), creation of transgenic 

25 animals exhibiting temporal control of Cre expression, cell-lineage analysis in 

transgenic animals, and generation of tissue-specific knockouts or knockdowns in 
transgenic animals. 

[00126] According to certain embodiments of the invention the transfer plasmid 
includes two loxP sites. Furthermore, in preferred embodiments of the invention the 
30 transfer plasmid includes a cloning site, e.g., a unique restriction site, between the two 
loxP sites, which allows the convenient insertion of a heterologous nucleic acid 
sequence. According to certain embodiments of the invention the transfer plasmid 
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includes a MCS between the two loxP sites. According to certain embodiments of the 
invention the two loxP sites are located between an HIV FLAP element and a WRE. 
According to certain embodiments of the invention the plasmid contains a unique 
restriction site between the 3' loxP site and the WRE. 
5 [00127] As described above, positioning the heterologous nucleic acid sequence 
between loxP sites allows for controlled expression of the heterologous sequence 
following transfer into a cell. By inducing Cre expression within the cell, the 
heterologous nucleic acid sequence is excised, thus preventing further transcription 
and effectively eliminating expression of the sequence. Cre expression may be 

1 0 induced in any of a variety of ways. For example, Cre may be present in the cells 
under control of an inducible promoter, and Cre expression may be induced by 
activating the promoter. Alternately, Cre expression may be induced by introducing 
an expression vector that directs expression of Cre into the cell Any suitable 
expression vector can be used, including, but not limited to, viral vectors such as 

15 adenoviral vectors. (The phrase "inducing Cre expression" as used herein refers to 
any process that results in an increased level of Cre within a cell.) 
[00128] The invention thus provides a method for achieving controlled expression 
of a heterologous nucleic acid sequence comprising steps of inserting the 
heterologous nucleic acid sequence into a transfer plasmid of the invention between 

20 sites for a recombinase, thereby producing a modified transfer plasmid; introducing 
the modified transfer plasmid or a portion thereof including at least the sites for the 
recombinase and the region between the sites into a cell and; subsequently inducing 
expression of the recombinase within the cell. According to certain embodiments of 
the invention the cell is a mammalian cell. According to certain embodiments of the 

25 invention the recombinase is Cre and the sites for the recombinase are loxP sites. In 
accordance with the invention the transfer plasmid may be introduced into the cell 
using standard techniques such as transfection. Alternately, the transfer plasmid may 
be used to generate a lentiviral particle that includes a lentiviral genome comprising 
the recombinase sites and the region between them. As described elsewhere herein, 

30 the genome integrates into the cell's DNA and directs expression of the heterologous 
nucleic acid sequence. The cell may be used for any of a variety of purposes as 
described in more detail below. 
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[00129] The lentiviral transfer plasmids comprising two loxP sites are useful in any 
applications for which standard vectors comprising two loxP sites can be used. For 
example, selectable markers may be placed between the loxP sites. This allows for 
sequential and repeated targeting of multiple genes to a single cell (or its progeny). 
5 After introduction of a transfer plasmid comprising a floxed selectable marker into a 
cell, stable transfectants may be selected. After isolation of a stable transfectant, the 
marker can be excised by induction of Cre. The marker may then be used to target a 
second gene to the cell or its progeny. Lentiviral particles comprising a lentiviral 
genome derived from the transfer plasmids may be used in the same manner. 

10 [00130] As another example, standard gene-targeting techniques may be used to 
produce a mouse in which an essential region of a gene of interest is floxed, so that 
tissue-specific Cre expression results in the inactivation of this allele. The transfer 
plasmids may be introduced into cells (e.g., ES cells) using pronuclear injection. 
Alternately, the cells may be injected or infected with lentiviral particles comprising a 

1 5 lentiviral genome derived from the transfer plasmid. Tissue -specific Cre expression 
may be achieved by crossing a mouse line with a conditional allele (i.e., a floxed 
nucleic acid sequence) to an effector mouse line that expresses cre in a tissue-specific 
manner, so that progeny are produced in which the conditional allele is inactivated 
only in those tissues or cells that express Cre. Suitable transgenic lines are known in 

20 the art and may be found, for example, in the Cre Transgenic Database at the Web site 
having URL www.mshri.on.ca/nagy/Cre-pub.html. 

[00131] Internal ribosome entry sequence (IRES). The transfer plasmids may also 
include an IRES. IRES elements function as initiators of the efficient translation of 
reading frames. An IRES allows ribosomes to start the translation process anew with 

25 whatever is immediately downstream and regardless of whatever was upstream. In 
particular, an IRES allows for the translation of two different genes on a single 
transcript. For example, an IRES allows the expression of a marker such as EGFP off 
the same transcript as a transgene, which has a number of advantages: (1) The 
transgene is native and does not have any fused open reading frames that might affect 

30 function; (2) Since the EGFP is from the same transcript its levels should be an 
accurate representation of the levels of the upstream transgene. IRES elements are 
known in the art and are further described in Kim, et al., Molecular and Cellular 
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Biology 12(8):3636-3643 (August 1992) and McBratney, et al., Current Opinion in 
Cell Biology 5:961-965 (1993). 

[00132] Any of a wide variety of sequences of viral, cellular, or synthetic origin 
which mediate internal binding of the ribosomes can be used as an IRES. Examples 
5 include those IRES elements from poliovirus Type I, the 5TJTR of 

encephalomyocarditis virus (EMV), of Thelier ? s murine encephalomyelitis virus 
(TMEV) of foot and mouth disease virus (FMDV) of bovine enterovirus (BEV), of 
coxsackie B virus (CBV), or of human rhino virus (HRV), or the human 
immunoglobulin heavy chain binding protein (BIP) 5'UTR, the Drosophila 
10 antennapediae 5TJTR or the Drosophila ultrabithorax 5'UTR, or genetic hybrids or 
fragments from the above-listed sequences. 

[00133] Transfer plasmids incoiporating heterologous nucleic acids. The invention 
provides new lentiviral transfer constructs incorporating a variety of heterologous 
nucleic acids (also referred to as heterologous sequences or heterologous nucleic acid 

15 segments), preferably operably linked to a promoter or promoter/enhancer element. 
These sequences may be inserted at any available site within the transfer plasrnid 
including, but not limited to, at a restriction site within a MCS. In general, the 
inserted nucleotide sequence may be any nucleotide sequence and may be a naturally 
occurring sequence or variant thereof or an artificial sequence. Heterologous gene 

20 sequences of the present invention may comprise one or more gene sequences that 
already possess one or more regulatory elements such as promoters, initiation 
sequences, processing sequences, etc. Alternatively, such regulatory elements may be 
present within the transfer plasrnid prior to insertion of the heterologous sequence. 
[00134] According to certain embodiments of the invention the inserted 

25 heterologous sequence is a reporter gene sequence. A reporter gene sequence, as used 
herein, is any gene sequence which, when expressed, results in the production of a 
protein whose presence or activity can be monitored. Suitable reporter gene 
sequences include, but are not limited to, sequences encoding chemiluminescent or 
fluorescent proteins such as green fluorescent protein (GFP) and variants thereof such 

30 as enhanced green fluorescent protein (EGFP); cyan fluorescent protein; yellow 

fluorescent protein; blue fluorescent protein; dsRed or dsRed2, luciferase, aequorin, 
etc. Many of these markers and their uses are reviewed in van Roessel, P. and Brand, 
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A., Nature Cell Biology, 4(1), E15-20, 2002, and references therein, all of which are 
incorporated herein by reference. Additional examples of suitable reporter genes 
include the gene for galactokinase, beta-galactosidase, chloramphenicol 
acetyltransferase, beta-lactamase, etc. Alternatively, the reporter gene sequence may 
5 be any gene sequence whose expression produces a gene product which affects cell 
physiology or phenotype. In general, a reporter gene sequence typically encodes a 
protein that is not normally present within a cell into which the transfer plasmid is to 
be introduced. 

[00135] According to certain embodiments of the invention the inserted - 
10 heterologous sequence is a selectable marker gene sequence, which term is used 

herein to refer to any gene sequence capable of expressing a protein whose presence 
permits the selective maintenance and/or propagation of a cell which contains it. 
Examples of selectable marker genes include gene sequences capable of conferring 
host resistance to antibiotics (e.g., puromycin, ampicillin, tetracycline, kanamycin, 
15 and the like), or of conferring host resistance to amino acid analogues, or of 
permitting the growth of cells on additional carbon sources or under otherwise 
impermissible culture conditions. A gene sequence may be both a reporter gene and a 
selectable marker gene sequence. In general, preferred reporter or selectable marker 
gene sequences are sufficient to permit the recognition or selection of the plasmid in 
20 normal cells. 

[00136] The heterologous sequence may also comprise the coding sequence of a 
desired product such as a biologically active protein or polypeptide (e.g., a 
therapeutically active protein or polypeptide) and/or an immunogenic or antigenic 
protein or polypeptide. Introduction of the transfer plasmid into a suitable cell thus 

25 results in expression of the protein or polypeptide by the cell. Alternatively, the 
heterologous gene sequence may comprise a nucleic acid segment that provides a 
template for transcription of an antisense RNA, a ribozyme, or, preferably, one or 
more strands of a short interfering RNA (siRNA) or a precursor thereof such as a 
short hairpin RNA (shRNA). As described further below, siRNAs and shRNAs 

30 targeted to cellular transcripts inhibit expression of such transcripts. Introduction of 
the transfer plasmid into a suitable cell thus results in production of the siRNA or 
shRNA, which inhibits expression of the target transcript. 
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[00137] Tliree and four plasmid lentiviral expression systems. The invention 
further provides a recombinant lentiviral expression system comprising three 
plasmids. The first plasmid is constructed to contain mutations that prevent 
lentivirus-mediated transfer of viral genes. Such mutations may be a deletion of 
5 sequences in the viral env gene, thus preventing the generation of replication- 
competent lentivirus, or may be deletions of certain cis-acting sequence elements at 
the 3 ! end of the genome required for viral reverse transcription and integration. Thus 
even if viral genes from this construct are packaged into viral particles, they will not 
be replicated and replication-competent wild-type viruses will not be produced. The 

10 first plasmid (packaging plasmid) comprises a nucleic acid sequence of at least part of 
a lentiviral genome, wherein the vector (i) contains at least one defect in at least one 
gene encoding a lentiviral structural protein, and (ii) lacks a functional packaging 
signal. The second plasmid (Env-coding plasmid) comprises a nucleic acid sequence 
of a virus, wherein the vector (i) expresses a viral envelope protein, and (ii) lacks a 

1 5 functional packaging signal. The third plasmid may be any of the inventive transfer 
plasmids described above. The first and second plasmids are further described below, 
and schematic diagrams of relevant portions of representative first and second 
plasmids (packaging and Env-coding) are presented in Figure 10A, which is taken 
from reference 21 . The third plasmid (not shown) is a transfer plasmid. 

20 [00138] Packaging plasmid. In certain embodiments of the invention the first 

vector is a gag/pol expression vector, i.e., a plasmid capable of directing expression of 
functional forms of a retroviral gag gene product and a retroviral Pol gene product. 
These proteins are necessary for assembly and release of viral particles from cells. 
The first plasmid may also express sequences encoding various accessory lentiviral 

25 proteins including, but not limited to, Vif, Vpr, Vpu, Tat, Rev, and Nef. In particular, 
the first plasmid may express a sequence encoding Rev. In general, the gag and pol 
sequences may be derived from any retrovirus, and the accessory sequences may be 
derived from any lentivirus. According to certain embodiments of the invention the 
gag and pol sequences and any accessory sequences are derived from HIV-1. It is 

30 noted that the gag, pol, and accessory protein sequences need not be identical to wild 
type versions but instead may contain mutations, deletions, etc., that do not 
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significantly impair the ability of the protein to perform its function in the viral life 
cycle. 

[00139] The first plasmid is preferably constructed to contain mutations that 
exclude retroviral-mediated transfer of viral genes. Such mutations may be a deletion 
5 or mutation of sequences in the viral env gene, thus excluding the possibility of 

generating replication-competent lentivirus. Alternatively, or in addition to, deletion 
or mutation of env, according to certain embodiments of the invention the plasmid 
sequence may contain deletions of certain cis-acting sequence elements at the 3 f end 
of the genome required for viral reverse transcription and integration. Accordingly, 

10 even if viral genes from this construct are packaged into viral particles, they will not 
be replicated and replication-competent wild-type viruses will not be generated. Any 
of a wide variety of packaging plasmids may be used in the three plasmid lenti viral 
expression system of the invention including, but not limited to, those described in 
references 21, 24, 29, and 40. 

1 5 [00140] Env-coding plasmid. This plasmid directs expression of a viral envelope 
protein and, therefore, comprises a nucleic acid sequence encoding a viral envelope 
protein under the control of a suitable promoter. The promoter can be any promoter 
capable of directing transcription in cells into which the plasmid is to be introduced. 
One of ordinary skill in the art will readily be able to select an appropriate promoter 

20 among, for example, the promoters mentioned above. For example, according to 

t 

certain embodiments of the invention a CMV promoter is used. The Env-coding 
plasmid preferably contains any additional sequences needed for efficient 
transcription, processing, etc., of the env transcript including, but not limited to, a 
polyadenylation signal such as any of those mentioned above. 

25 [00141] The host range of cells that the viral vectors of the present invention can 
infect may be altered (e.g., broadened) by utilizing an envelope gene from a different 
virus. Thus is possible to alter or increase the host range of the vectors of the present 
invention by taking advantage of the ability of the envelope proteins of certain viruses 
to participate in the encapsidation of other viruses. In a preferred embodiment of the 

30 present invention, the G-protein of vesicular-stomatitis virus (VSV-G; see, e.g., Rose 
and Gillione, J. Virol. 39, 519-528 (1981); Rose and Bergmarin, Cell 30, 753-762 
(1982)), or a fragment or derivative thereof, is the envelope protein expressed by the 
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second plasmid. VSV-G efficiently forms pseudotyped virions with genome and 
matrix components of other viruses. As used herein, the term "pseudotype" refers to a 
viral particle that contains nucleic acid of one virus but the envelope protein of 
another virus. In general, VSV-G pseudotyped viruses have a very broad host range, 
5 and may be pelleted to titers of high concentration by ultracentrifugation (e.g., 

according to the method of J. C. Burns, et al., Proc. Natl Acad. Sci. USA 90, 8033- 
8037 (1993)), while still retaining high levels of infectivity. 
[00142] Additional envelope proteins that may be used in accordance with the 
present invention include, but are not limited to, ecotropic or amphotropic MLV 

10 envelopes, 10A1 envelope, truncated forms of the HIV env, GALV, BAEV, SIV, 

FeLV-B, RD1 14, SSAV, Ebola, Sendai, FPV (Fowl plague virus), and influenza virus 
envelopes. Similarly, genes encoding envelopes from RNA viruses (e.g. RNA virus 
families of Picornaviridae, Calciviridae, Astro viridae, Togaviridae, Flaviviridae, 
Coronaviridae, Paramyxoviridae, Rhabdoviridae, Filoviridae, Orthomyxoviridae, 

15 Bunyaviridae, Arenaviridae, Reoviridae, Birnaviridae, Retroviridae) as well as from 
the DNA viruses (families of Hepadnaviridae, Circoviridae, Parvo viridae, 
Papovaviridae, Adenoviridae, Herpesviridae, Poxviridae, and Iridoviridae) may be 
utilized. Representative examples include FIV, FeLV, RSV, VEE, HFVW, WDSV, 
SFV, Rabies, ALV, BIV, BLV, EBV, CAEV, HTLV, SNV, ChTLV, STLV, MPMV, 

20 SMRV, RAV, FuSV, MH2, AEV, AMV, CT10, EIAV. In addition to the above, 
hybrid envelopes (e.g. envelope comprising regions of more than one of the above), 
may be employed. According to certain embodiments of the invention the envelope 
recognizes a unique cellular receptor (e.g., a receptor found only on a specific cell 
type or in a specific species), while according to certain other embodiments of the 

25 invention the envelope recognizes multiple different receptors. According to certain 
embodiments of the invention the second plasmid encodes a cell or tissue specific 
targeting envelope. Cell or tissue specific targeting may be achieved, for example, by 
incorporating particular sequences within the envelope sequence (e.g., sequences 
encoding ligands for cell or tissue-specific receptors, antibody sequences, etc.). Thus 

30 any of a wide variety of Env-coding plasmids may be used in the three plasmid 
lentiviral expression system of the invention including, but not limited to, those 
described in references 21, 24, 29, and 40. 
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[00143] Variations on the three plasmid system. The invention further provides a 
four plasmid lentiviral expression system comprising a three plasmid lentiviral 
expression system as described herein and a fourth plasmid comprising a nucleic acid 
sequence encoding the Rev protein (in which case the rev gene is generally not 
5 included in the other plasmids. As mentioned above, the presence of Rev increases 
the level of transcription during production of lentiviral particles. It will be 
appreciated that a variety of alternative three or four plasmid systems may be 
employed while maintaining the feature that no sequence of recombination event(s) 
between only two of the three or four plasmids is sufficient to generate replication- 

10 competent virus. For example, either Gag or Pol or any of the accessory proteins may 
be encoded by the plasmid referred to as the Env-coding plasmid. Alternately, Gag, 
Pol, or any of the accessory proteins may be encoded by the transfer plasmid. In 
addition, sequences encoding Rev may be provided on the same plasmid that encodes 
Gag, Pol, or Env. According to certain embodiments of the invention sequences 

1 5 encoding a functional Tat protein are absent from the plasmids, and sequences 

encoding Rev are provided on a separate plasmid rather than on the same plasmid as 
sequences encoding other viral genes, as described in reference 40. Schematic 
diagrams of relevant portions of representative first and second plasmids (packaging 
and Env-coding) and fourth plasmid encoding Rev are presented in Figure 10B, which 

20 is taken from reference 40. The third plasmid (not shown) is a transfer plasmid. 
[00144] Applications of the lentiviral transfer plasmids and expression systems. 
The lentiviral transfer plasmids and lentiviral expression systems of the invention 
have a wide variety of uses, some of which have been described above. As will be 
evident, the transfer plasmids may be used for any application in which a 

25 conventional expression plasmid is employed. As described in Examples 3 through 6, 
the transfer plasmids of the invention are able to drive expression of heterologous 
genes (e.g., EGFP) when transfected into cells and are also able to drive synthesis of 
shRNA when transfected into cells. 

[00145] The presence of one or more MCSs means that the plasmids may 
30 conveniently be used for insertion and subsequent expression of any heterologous 
sequence. In particular, the transfer plasmids that include an insertion site such as an 
MCS between sites for a recombinase such as loxP may be used for easy assembly of 
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a proinoter-site-sequence-site cassette, (where "site" indicates a recombination site for 
a recombinase and "sequence 55 indicates a heterologous sequence of interest), e.g., a 
promoter-loxP-sequence-loxP site that can then be moved into another vector. It is 
noted that the transfer plasmids can be used to direct expression of a heterologous 
5 nucleic acid in a variety of eukaryotic cells other /than mammalian cells, provided a 
promoter capable of directing transcription in such cells is employed. Thus references 
to "mammalian cells" herein should not be understood to exclude non-mammalian 
cells, as long as an appropriate promoter for transcription in non-mammalian cells is 
provided. 

1 0 [00146] Introducing plasmids into cells. In general, the plasmids described herein 
may be introduced into cells via conventional transformation or transfection 
techniques. As used herein, the terms "transformation" and "transfection" are 
intended to refer to a variety of art-recognized techniques for introducing foreign 
nucleic acid (e.g., DNA or RNA) into cells, including calcium phosphate or calcium 

15 chloride co-precipitation, DEAE-dextran-mediated transfection, lipofection, injection, 
or electroporation. 

[00147] Production of replication-defective lentiviral particles. In general, the 
transfer plasmids and the three-plasmid recombinant lentiviral expression systems of 
the invention may be used to produce infectious, replication-defective lentiviral 

20 particles according to methods known to those skilled in the art. In the case of the 
recombinant lentiviral expression system of the invention the methods include (i) 
transfecting a lentivirus-permissive cell with the three-plasmid lentiviral expression 
system of the present invention; (ii) producing the lentivirus-derived particles in the 
transfected cell; and (iii) collecting the virus particles from the cell. The step of 

25 transfecting the lentivirus-permissive cell can be carried out according to any suitable 
means known to those skilled in the art. For example, the three-plasmid expression 
system described herein may be used to generate lentivirus-derived retroviral vector 
particles by transient transfection. The plasmids may be introduced into cells by any 
suitable means, including, but not limited to, calcium phosphate or calcium chloride 

30 co-precipitation, DEAE-dextran-mediated transfection, lipofection, injection, or 
electroporation. 



Page 47 of 171 



WO 2004/022722 



PCT/US2003/028111 



[00148] The transfer plasmids of the invention may be used to produce infectious, 
replication-defective lentiviral particles in a similar manner using helper cells that 
express the necessary viral proteins as known in the art and mentioned above. In 
general, the transfer plasmids may be used to produce infectious, replication-defective 
5 lentiviral particles in conjunction with any system using any combination of plasmids 
and/or helper cell lines that provides the appropriate combination of required genes: 
gag,pol, env 9 and, preferably, rev in cases where transcription occurs from a gag/pol 
expression cassette containing a Rev-response element (or alternately a system that 
supplies the various proteins encoded by these genes). 

10 [00149] Infectious virus particles may be collected using conventional techniques. 
For example, the infectious particles may be collected by cell lysis, or collection of 
the supernatant of the cell culture, as is known in the art. Optionally, the collected 
virus particles may be purified if desired. Suitable purification techniques are well 
known to those skilled in the art. Methods for titering virus particles are also well 

1 5 known in the art. Further details are provided in the Examples. 

[00150] Producer cell lines. As will be evident, when a host cell permissive for 
production of lentiviral particles is transfected with the plasmids of the three-plasmid 
system, the cell becomes a producer cell, i.e., a cell that produces infectious lentiviral 
particles. Similarly, when a helper cell that produces the necessary viral proteins is 

20 transfected with a transfer plasmid of the invention, the cell becomes a producer cell. 
The invention therefore provides producer cells and corresponding producer cell lines 
and methods for the production of such cells and cell lines. In particular, the 
invention provides a method of creating a producer cell line comprising introducing a 
transfer plasmid of the invention into a host cell; and introducing a packaging plasmid 

25 and an envelope plasmid into the host cell. The invention provides another method of 
creating a producer cell line comprising introducing a transfer plasmid of the 
invention into a helper cell that produces viral proteins necessary for encapsidation of 
a lentiviral genome and subsequent infectivity of a lentiviral particle resulting from 
encapsidation. 

30 [00151] The inclusion of appropriate genetic elements from various papovaviruses 
allows plasmids to be maintained as episomes within mammalian cells. Such 
plasmids are faithfully distributed to daughter cells. In particular, viral elements of 
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various polyomaviruses and papillomaviruses such as BK virus (BKV), bovine 
papilloma virus 1 (BPV-1) and Epstein-Barr virus (EBV), among others, are useful in 
this regard. The invention therefore provides lenti viral transfer plasmids comprising a 
viral element sufficient for stable maintenance of the transfer plasmid as an episome 
5 within mammalian cells. Appropriate genetic elements and their use are described, 
for example, in Van Craenenbroeck, et aL, Eur. J. Biochem. 267, 5665-5678 (2000) 
and references therein, all of which are incorporated herein by reference. 
[00152] The invention further provides cell lines comprising the transfer plasmids 
described above, i.e., cell lines in which the transfer plasmids are stably maintained as 
10 episomes. In particular, the invention provides producer cell lines (cell lines that 

produce the proteins needed for production of infectious lentiviral particles) in which 
the transfer plasmids are stably maintained as episomes. According to certain 
embodiments of the invention these cell lines constitutively produce lentiviral 
particles. 

15 [001 53] According to other embodiments of the invention one or more of the 
necessary viral proteins is under the control of an inducible promoter. Thus the 
invention provides helper cell lines in which the transfer plasmids are stably 
expressed as episomes, wherein at least one viral protein expressed by the cell line is 
under control of an inducible promoter. This allows the cells to be expanded under 

20 conditions that are not permissive for viral production. Once the cells have reached a 
desired density (e.g., confluence), or a desired cell number, etc., the protein whose 
expression is under control of the inducible promoter can be induced, allowing 
production of viral particles to begin. This system offers a number of advantages. In 
particular, since every cell has the required components, titer is increased. In 

25 addition, it avoids the necessity of performing a transfection each time a particular 
virus is desired. Any of a variety of inducible promoters known in the art may be 
used. One of ordinary skill in the art will readily be able to select an appropriate 
inducible promoter and apply appropriate techniques to induce expression therefrom. 
[00154] The invention thus provides a method of producing lentiviral particles 

30 comprising introducing a lentiviral transfer plasmid of the invention, which lentiviral 
transfer plasmid comprises a genetic element (e.g., a viral element) sufficient for 
stable maintenance of the transfer plasmid as an episome in mammalian cells, into a 
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helper cell that produces proteins needed for production of infectious lentiviral 
particles and; culturing the cell for a period sufficient to allow production of lenti viral 
particles. The invention further provides a method of producing lentiviral particles 
comprising introducing a lentiviral transfer plasmid of the invention, which lentiviral 
5 transfer plasmid comprises a genetic element sufficient for stable maintenance of the 
transfer plasmid as an episome in mammalian cells, into a helper cell that expresses a 
protein required for production of lentiviral particles, wherein expression of the 
protein is under control of an inducible promoter; inducing expression of the protein 
required for production of lentiviral particles; and culturing the cell for a period 

1 0 sufficient to allow production of lentiviral particles. 

[00155] Transgenic and btockout animals. The transfer plasmids may be used to 
generate stable transgenic or knockout animals, wherein the transgene is a 
heterologous nucleic acid contained in the transfer plasmid. Transgenic animals may 
be generated through standard (non- viral) means such as pronuclear injection of the 

1 5 transfer plasmid. In addition, the lentiviral particles may be used to create transgenic 
animals wherein the transgene is a heterologous nucleic acid contained in the 
lentiviral particle. For example, lentiviral particles of the invention may be injected 
into the perivitelline space of single-cell embryos, which may then be implanted and 
carried to term. Alternately, the zona pellucida may be removed and the denuded 

20 embryo incubated with lentiviral suspension prior to implantation as described in 
reference 24. This approach offers a more efficient method of creating a variety of 
transgenic animals, e.g., birds, rats, and other mammals. As used herein, a 
"transgenic animal" is a non-human animal, preferably a mammal, more preferably a 
rodent such as a rat or mouse, in which one or more of the cells of the animal includes 

25 a transgene. Other examples of transgenic animals include non-human primates, 
sheep, dogs, cows, goats, chickens, amphibians, and the like. Transgenic animals 
typically carry a gene which has been introduced into the germline of the animal, or 
an ancestor of the animal, at an early (usually one-cell) developmental stage. In 
general, a transgene is heterologous DNA, which preferably is integrated into or 

30 occurs in the genome of the cells of a transgenic animal. Integration of the transgene 
may lead to a deletion of endogenous chromosomal DNA, e.g., by homologous 
recombination, such that the function of an expression product of the DNA is 
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impaired or eliminated. In this case the resulting animal is referred to as a knockdown 
or knockout animal. Note that transgene sequences may include endogenous 
sequences but typically also include additional sequences that do not naturally occur 
in the animal. 

5 [00156] As described in Example 7, the inventors have generated transgenic mice 
using a lentiviral particle comprising a heterologous nucleic acid encoding the 
fluorescent protein GFP, which serves as a transgene. The lentiviral particles were 
able to induce expression of GFP within embryonic stem cells (ES cells), and these 
ES cells gave rise to transgenic animals whose cells expressed GFP. These results 
10 demonstrate that heterologous nucleic acids contained in the lentiviral particles of the 
invention are not subject to developmental silencing. 

[00157] Constitutive, conditional, reversible, and tissue-specific expression. The 
transfer plasmids and lentiviral particles of the invention may be used to achieve 
constitutive, conditional, reversible, or tissue-specific expression in cells, tissues, or 

15 organisms, including transgenic animals. The invention provides a method of 

reversibly expressing a transcript in a cell comprising: (i) delivering a lentiviral vector 
to the cell, wherein the lentiviral vector comprises a heterologous nucleic acid, and 
wherein the heterologous nucleic acid is located between sites for a site-specific 
recombinase; and (ii) inducing expression of the site-specific recombinase within the 

20 cell, thereby preventing synthesis of synthesis of the transcript within those cells. 
According to certain embodiments of the invention the cell is a mammalian cell. 
According to certain embodiments of the invention the step of inducing the site- 
specific recombinase comprises introducing a vector encoding the site-specific 
recombinase into the cell. According to other embodiments of the invention a nucleic 

25 acid encoding the site-specific recombinase is operably linked to an inducible 

promoter, and the inducing step comprises inducing the promoter as described above. 
As discussed in more detail in Example 8, the inventors have shown that introduction 
of a lentiviral particle comprising a heterologous nucleic acid encoding the 
fluorescent protein EGFP between loxP sites into cells results in expression of EGFP 

30 within the cells. When the EGFP-expressing cells were subsequently infected with an 
adenovirus containing a nucleic acid encoding Cre, thereby inducing expression of 
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Cre within the cells, expression of EGFP was eliminated in a significant proportion of 
the cells. Thus expression of EGFP was reversible. 

[001581 In addition, the invention provides a variety of methods for achieving 
conditional and/or tissue-specific expression. For example, the invention provides a 
5 method for expressing a transcript in a mammal in a cell type or tissue-specific 

manner comprising: (i) delivering a lentiviral transfer plasmid or lentiviral particle to 
cells of the mammal, wherein the lentiviral transfer plasmid or lentiviral particle 
comprises a heterologous nucleic acid, and wherein the heterologous nucleic acid is 
located between sites for a site-specific recombinase; and (ii) inducing expression of 

10 the site-specific recombinase in a subset of the cells of the mammal, thereby 
preventing synthesis of the transcript within those cells. According to certain 
embodiments of the inventive methods the recombinase is Cre. According to certain 
embodiments of the invention the step of inducing the site-specific recombinase 
comprises introducing a vector encoding the site-specific recombinase into the cell. 

15 According to other embodiments of the invention a nucleic acid encoding the site- 
specific recombinase is operably linked to an inducible promoter, and the inducing 
step comprises inducing the promoter as described above. In certain embodiments of 
the invention the nucleic acid encoding the site-specific recombinase is operably 
linked to a cell type or tissue-specific promoter, so that synthesis of the recombinase 

20 takes place only in cells or tissues in which that promoter is active. 

[001591 Gene and transcript silencing. As described in more detail below, the 
invention provides methods of reducing or inhibiting the expression of target genes 
and/or transcripts (which need not necessarily encode proteins) by exploiting the 
phenomenon of RNA interference (RNAi). For example, the invention provides a 

25 method of inhibiting or reducing the expression of a target transcript in a cell 

comprising delivering a lentiviral vector (e.g., a lentiviral transfer plasmid or lentiviral 
particle) to the cell, wherein presence of the lentiviral vector within a cell results in 
synthesis of one or more RNAs that self-hybridize or hybridize with each other to 
form a short hairpin RNA or short interfering RNA that is targeted to the target 

30 transcript. Such lentiviral expression vectors may be used therapeutically to silence 
disease-causing genes and/or render cells resistant to infectious organisms. In 
addition, lentiviral expression vectors may facilitate the creation of animals deficient 
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in immunogenic xenoantigens as sources of organs for organ transplantation. It will 
be appreciated that in those embodiments of the invention in which the nucleic acid 
segment that provides a template for synthesis of the one or more RNAs that self- 
hybridize or hybridize with each other to form an shRNA or siRNA is floxed, 
5 inhibition of the target transcript may be reversed by expressing Cre, thereby excising 
the template for the siRNA or shRNA. Thus the invention allows conditional and 
tissue-specific expression of target transcripts in cells, tissues, or organisms. RNAi 
and methods of using the plasmids and expression systems of the invention for 
achieving RNAi are described below. 

10 [00160] RNA interference 

[00161] Small inhibitory RNAs were first discovered in studies of the phenomenon 
of RNA interference (RNAi) in Drosophila, as described, for example, in WO 
01/75164, etc. It was found that, in Drosophila, long double-stranded RNAs are 
processed by an RNase Hi-like enzyme called DICER (Bernstein et al., Nature 

15 409:363, 2001) into smaller dsRNAs comprised of two 21 nt strands, each of which 
has a 5' phosphate group and a 3' hydroxyl, and includes a 19 nt region precisely 
complementary with the other strand, so that there is a 19 nt duplex region flanked by 
2 nt-3' overhangs (see Figure 1 1). These small dsRNAs (siRNAs) act to silence 
expression of any gene that includes a region complementary to one of the dsRNA 

20 strands, presumably because a helicase activity unwinds the 19 bp duplex in the 

siRNA, allowing an alternative duplex to form between one strand of the siRNA and 
the target transcript. This new duplex then guides an endonuclease complex, RISC, to 
the target RNA, which it cleaves ("slices") at a single location, producing unprotected 
RNA ends that are promptly degraded by cellular machinery (see Figure 12). 

25 [00162] Homologs of the DICER enzyme are found in diverse species ranging 
from C elegans to humans (Sharp, Genes Dev. 15;485, 2001; Zamore, Nat. Struct. 
Biol. 8:746, 2001), raising the possibility that an RNAi-like mechanism might be able 
to silence gene expression in a variety of different cell types including mammalian, or 
even human, cells. However, long dsRNAs (e.g., dsRNAs having a double-stranded 

30 region longer than about 30 - 50 nucleotides) are known to activate the interferon 
response in mammalian cells. Thus, rather than achieving the specific gene silencing 
observed with the Drosophila RNAi mechanism, the presence of long dsRNAs into 
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mammalian cells would be expected to lead to interferon-mediated non-specific 
suppression of translation, potentially resulting in cell death. Long dsRNAs are 
therefore not thought to be useful for inhibiting expression of particular genes in 
mammalian cells. 

5 [00163] In contrast, siRNAs, when present in mammalian cells, can effectively 
reduce the expression of target transcripts and genes in a specific manner without 
activating the anti-viral response (9, 10). Preferred siRNAs typically include a base- 
paired region approximately 19 nt long, and may further comprise one or more single- 
stranded regions, typically 3' overhangs on one or both strands. Figures 1 1 and 13 

10 presents various structures that can be utilized to mediate RNA interference. Figure 
1 1 shows the siRNA structure found to be active in the Drosophila system and likely 
represents the species that is active in mammalian cells. This structure consists of two 
21 nt strands having a complementary core region of 19 nt and 2 nt 3' overhangs at 
each end of the double-stranded region. Figures 13 A, 13B, 13C, and 13D represent 

15 additional structures that may be used to mediate RNA interference. These hairpin 
(stem-loop) structures may function directly as inhibitory RNAs or may be processed 
intracellularly to yield an siRNA structure such as that depicted in Figure 11. 
[00164] Many different RNA species having structures such as these have been 
introduced into mammalian cells and have been shown to reduce expression of target 

20 transcripts. For example, siRNAs targeted to transcripts encoding the HIV Gag 

protein or the the HIV-1 cellular receptor CD4 reduced the level of the corresponding 
mRNAs and proteins Gag in cells infected with HIV (Novina, C, et al, Nat Med t 
8(7):681-6, 2002), resulting in inhibition of virus production. Studies such as this, 
demonstrating siRNA-mediated inhibition of cellular genes as well as genes of 

25 infectious organisms, demonstrate the therapeutic potential of RNA interference for a 
wide variety of conditions. In addition, the ability to selectively reduce or eliminate 
expression of particular genes has profound implications for the study of gene 
function. 

[00165] In general, preferred siRNAs reduce the target transcript level or level of 
30 the encoded protein at least about 2 fold, preferably at least about 5 fold, more 

preferably at least about 10 fold, at least about 25 fold, at least about 50 fold or to an 
even greater degree relative to the level that would be present in the absence of the 
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inhibitory RNA. In selecting a target sequence for any particular transcript it may be 
desirable to test a variety of siRNAs in order to identify one with an appropriate 
efficacy. 

[00166] In general, an siRNA includes a double-stranded region (the "inhibitory 
5 region"), one strand of which is substantially complementary to a portion of the target 
transcript, so that a precise hybrid can form in vivo between one strand of the siRNA 
and the target transcript. The portion of the target transcript to which the siRNA 
strand hybridizes may be referred to as the target or targeted portion or site. In certain 
preferred embodiments of the invention, the relevant inhibitor region of the siRNA is 

10 perfectly complementary with the target transcript; in other embodiments, one or 
more non-complementary residues are located at or near the ends of the 
siRNA/template duplex or elsewhere. As will be appreciated by those of ordinary 
skill in the art, it is generally preferred that mismatches in the central portion of the 
siRNA/template duplex be avoided (see, for example, Elb.ashir et al., EMBO 7. 

15 20:6877,2001). 

[00167] Generally any portion of a target transcript may be selected as the target 
site, to which the antisense strand of the siRNA will be complementary. It may be 
preferable to select siRNAs that hybridize with a target site that includes exonic 
sequences in the target transcript or hybridizes exclusively with exonic sequences. 

20 Hybridization with intronic sequences is not excluded, but generally appears not to be 
preferred in mammalian cells. An siRNA that hybridizes with a target site that 
includes only sequences within a single exon may be selected, or the target site may 
be created by splicing or other modification of a primary transcript. Any site that is 
available for hybridization with an siRNA antisense strand, resulting in slicing and 

25 degradation of the transcript may be utilized in accordance with the present invention. 
Nonetheless, those of ordinary skill in the art will appreciate that, in some instances, it 
may be desirable to select particular regions of target gene transcript as siRNA 
hybridization targets. For example, it may be desirable to avoid (i) sections of target 
transcript that may be shared with other transcripts whose degradation is not desired; 

30 (ii) sections of target transcript that are identical or homologous to other transcripts 
whose degradation is not desired. In general, coding regions and regions closer to the 
3' end of the transcript than to the 5' end are preferred. The 3 5 portion of target 
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transcripts may be less likely to exhibit secondary structure that may inhibit or 
interfere with siRNA activity, e.g., by reducing accessibility. 

[00168] In general, preferred siRNA sequences have a GC content between 30 and 
70% or, preferably, between 40 and 60%. In general, it is preferred to avoid target 
5 sequences that contain strings of >2 identical nucleotides (e.g., AAA, GGGG). 
siRNA sequences may conveniently be identified by scanning the cDNA sequence 
from 5 5 to 3' until an appropriate 19 nucleotide target is identified. If it is desired to 
include a 3' overhang in the antisense strand, the 19 nt sequence should be preceded 
by nucleotides complementary to the desired 3' overhang. For example, according to 
10 certain embodiments of the invention an siRNA sequence should correspond to: 
AAN 19 . 

[00169] Certain siRNAs hybridize to a target site that includes or consists entirely 
of 3 5 UTR sequences. Such siRNAs may tolerate a larger number of mismatches in 
the siRNA/template duplex, and particularly may tolerate mismatches within the 

15 central region of the duplex. In fact, some mismatches may be desirable as 

siRNA/template duplex formation in the 3' UTR may inhibit expression of a protein 
encoded by the template transcript by a mechanism related to but distinct from classic 
RNA inhibition. In particular, there is evidence to suggest that siRNAs that bind to 
the 3' UTR of a template transcript may reduce translation of the transcript rather than 

20 decreasing its stability. Specifically, as shown in Figure 14, the DICER enzyme that 
generates siRNAs in the Drosophila system discussed above and also in a variety of 
organisms, is known to also be able to process a small, temporal RNA (stRNA) 
substrate into an inhibitory agent that, when bound within the 3' UTR of a target 
transcript, blocks translation of the transcript (see Grishok, A., et aL, Cell 106, 23-24, 

25 2001; Hutvagner, G., et al, Science, 293, 834-838, 2001; Ketting, R., et al., Genes 
Dev., 15, 2654-2659). For the purposes of the present invention, any partly or fully 
double-stranded short RNA as described herein, one strand of which binds to a target 
transcript and reduces its expression (i.e., reduces the level of the transcript and/or 
reduces synthesis of the polypeptide encoded by the transcript) is considered to be an 

30 siRNA, regardless of whether the RNA acts by triggering degradation, by inhibiting 
translation, or by other means. In certain preferred embodiments of the invention, 
reducing expression of the transcript involves degradation of the transcript. In 
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addition any precursor structure (e.g., a short hairpin RNA, as described herein) that 
may be processed in vivo (i.e., within a cell or organism) to generate such an siRNA is 
useful in the practice of the present invention. 

[00170] Use of RNAi in mammalian cells, tissues, and organisms is currently 
5 restricted by the limited delivery methods available. siRNAs can be delivered to cells 
by various means, such as electroporation (1 1), use of lipofectants (10), or expression 
of short hairpin RNAs (shRNAs) in cells from a plasmid template (11-16). shRNAs 
are precursors of siRNAs, and typically comprise dsRNA stretches of at least 19 bp 
separated by a loop of several non self-complementary nucleotides. shRNAs adopt 

1 0 stem-loop structures, thought to be recognized and processed into siRNAs by the 
conserved cellular RNAi machinery (17; Ketting, R., et al, Genes Dev., 15, 2654- 
2659). While the approaches mentioned above have been successful at targeting gene 
expression in cell culture systems, in general they are not as readily applicable to 
primary cells, which are difficult to transfect by standard methods such as those 

15 mentioned above. Their use to target gene expression in mammalian subjects is also 
problematic. A further limitation of introducing siRNAs into cells by standard means 
is that the inhibitory (knockdown) effect is transient, as mammalian cells appear to 
lack the siRNA amplification mechanisms that confer RNAi potency and longevity in 
lower organisms (10). 

20 [00171] The present invention encompasses the recognition that use of lentiviral 
expression systems offer a means of overcoming problems associated with delivery of 
siRNAs into mammalian cells and tissues, including primary mammalian cells and 
tissues, nondividing cells (including neurons and naive T cells), and cells at early 
stages of development such as embryonic cells (including embryonic stem cells). The 

25 invention further emcompasses the recognition that use of lentiviral vectors offers a 
means of overcoming problems associated with delivery of siRNAs into mammalian 
subjects. 

[00172] The invention provides lentiviral vectors and expression systems capable 
of directing transcription of RNAs that hybridize to form shRNAs and/or siRNAs in 
30 mammalian cells. In particular, the invention provides a lentiviral vector comprising 
a nucleic acid segment operably linked to a promoter, so that transcription from the 
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promoter results in synthesis of one or more RNAs that self-hybridize or hybridize 
with each other to form an shRNA or siRNA targeted to a target transcript. 
[00173] The invention further provides a three-plasmid lenti viral expression system 
comprising (i) a lentiviral transfer plasmid comprising a nucleic acid segment 
5 operably linked to a promoter, so that transcription from the promoter results in 
synthesis of one or more RNAs that self-hybridize or hybridize with each other to 
form an shRNA or siRNA targeted to a target transcript; (ii) a packaging plasmid; and 
(iii) an Env-coding plasmid. In addition, the invention provides an infectious 
lentiviral particle comprising a nucleic acid segment operably linked to a promoter, so 

10 that transcription from the promoter results in synthesis of one or more RNAs that 
self-hybridize or hybridize with each other to form an shRNA or siRNA targeted to a 
target transcript. In other words, the nucleic acid segment(s) provides template(s) for 
synthesis of an RNA that self-hybridizes to form an shRNA or for synthesis of two 
complementary RNAs that hybridize to form an siRNA. 

15 [00174] According to certain embodiments of the invention the lentiviral vector 
comprises a nucleic acid segment which, when transcribed, produces an RNA that 
comprises two complementary elements that hybridize to one another to form a stem 
and a loop. The stem-loop structure is also referred to as a hairpin. Figure 15A 
j schematically depicts such a nucleic acid segment 10 operably linked to a promoter 

20 element 20. Nucleic acid segment 10 comprises complementary elements 30 and 40, 
separated by element 50. Preferably the nucleic acid includes a transcriptional 
terminator element 60, e.g., a terminator for RNA polymerase III such as a string of T 
residues. However, such a terminator element may also be provided within a vector 
into which the nucleic acid segment is inserted. Figure 15B schematically depicts an 

25 RNA 70 transcribed from nucleic acid segment 10 prior to hybridization. RNA 70 
comprises self-complementary elements 80 and 90. 

[00175] Figure 15C schematically depicts the RNA following hybridization of the 
complementary portions, resulting in formation of stem 100 and loop 110. 
Termination within the terminator sequence results in a 3' overhang 120, which may 
30 comprise one or more U residues. Preferably, the stem is approximately 19 bp long, 
the loop is about 1-20, more preferably about 4 -12, and most preferably about 6-10 
v nt long and/or the overhang is about 1-20, and more preferably about 2-6 nt long. In 
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certain preferred embodiments of the invention the overhang is 2 nt long. In certain 
embodiments of the invention the stem is minimally 19 nucleotides in length and may 
be up to approximately 29 nucleotides in length. One of ordinary skill in the art will 
appreciate that loops of 4 nucleotides or greater are less likely to be subject to steric 
5 constraints than are shorter loops and therefore may be preferred. 

[00176] Figure 17A schematically depicts the sequence of a nucleic acid 
comprising a segment which, when transcribed, produces an RNA that comprises two 
complementary elements that hybridize to one another to form a stem and a loop, 
inserted into the MCS of a lentiviral transfer plasmid of the invention. 

10 Complementary portions are indicated with arrows in opposite orientation to one 
another. Figure 17B depicts a nucleic acid which, when transcribed, results in an 
RNA targeted to the CD8 molecule. Figure 17C depicts the shRNA that results 
following hybridization of the complementary portions of an RNA transcribed from 
the nucleic acid in Figure 17B. The RNA forms a stem-loop structure in which the 

1 5 stem is targeted to CD8. As described in more detail in Example 3, the inventors have 
shown that lentiviral transfer plasmids comprising a heterologous nucleic acid whose 
sequence includes the CD8 stem-loop sequence inhibit expression of CD8 when 
introduced into cells. In addition, as described in Examples 4 and 5, lentiviral 
particles comprising a heterologous nucleic acid whose sequence includes the CD8 

20 stem-loop sequence inhibit expression of CD8 at both the mRNA and protein level 
when introduced into cells. Furthermore, the inhibition of expression persisted over 
the length of the experiment (1 month), demonstrating that RNAi mediated by the 
integrated lentivirus was stable. The inventors were unable to detect shRNA 
structures in the infected cells but were able to detect approximately 21 nucleotide- 

25 long RNAs comprising the CD8 stem loop sequence and having a typical siRNA 
structure. While not wishing to be bound by any theory, this results confirms the 
hypothesis that shRNAs are processed into siRNAs within the cell. The inventors 
also demonstrated that shRNA-mediated inhibition of CD8 was specific. In 
particular, shRNAs targeted to the mouse CD8 RNA did not inhibit expression of 

30 human CD8. 

[00177] The invention therefore provides a lentiviral vector, e.g., a lentiviral 
transfer plasmid or lentiviral particle comprising a nucleic acid segment that provides 
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a template for synthesis of one or more RNAs that self-hybridize or hybridize with 
each other to form an shRNA or siRNA, wherein the shRNA or siRNA is targeted to a 
target transcript and reduces expression of the transcript. For example, the invention 
provides a lentiviral transfer plasmid comprising the following elements: a nucleic 
5 acid including (i) a functional packaging signal; (ii) a multiple cloning site (MCS) 
into which a nucleic acid may be inserted; (iii) at least one additional element selected 
from the group consisting of: a second MCS, an HIV FLAP element, a heterologous 
promoter, a heterologous enhancer, an expression-enhancing posttranscriptional 
regulatory element, a target site for a site-specific recombinase, and a self-inactivating 

10 (SIN) LTR; and (iv) a nucleic acid segment that provides a template for synthesis of 
an shRNA or siRNA, which shRNA or siRNA is targeted to a target transcript and 
reduces expression of the transcript. In certain preferred embodiments of the 
invention the nucleic acid segment provides a template for synthesis of an RNA that 
self-hybridizes to form an shRNA. 

1 5 [00178] Any of the various embodiments of the elements included in the lentiviral 
transfer plasmid or lentiviral particle may be selected as described above. In 
particular, the invention provides a lentiviral transfer plasmid comprising the 
following elements: a nucleic acid including (i) a functional packaging signal; (ii) a 
multiple cloning site (MCS) into which a nucleic acid may be inserted; (iii) a second 

20 MCS; (iv) an HIV FLAP element; (v) a WRE; (vi) two loxP sites; (vii) a self- 
inactivating (SIN) LTR; and (viii) a nucleic acid segment operably linked to a PolIU 
promoter, wherein the nucleic acid segment provides a template for synthesis of one 
or more RNAs that self-hybridize or hybridize with each other to form an shRNA or 
siRNA, which shRNA or siRNA is targeted to a target transcript and reduces 

25 expression of the transcript. In certain preferred embodiments of the invention the 
nucleic acid segment provides a template for synthesis of an RNA that hybridizes to 
form an shRNA. 

[00179] Identification of sequences for design of the stem portion of an shRNA 
may be performed as described above for siRNAs. See also the Web sites having 
30 URLs 

www.mpibpc.gwdg.de/abteilungen/1 00/1 05/sirna.html and 

katahdin.chsl.org:9331/RNAi (visited October 23, 2002). The first step is to search 
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for potential target sequences, e.g., by scanning the cDNA. According to certain 
embodiments of the invention a potential target sequence corresponds to any sequence 
of the form GNig. According to other embodiments of the invention a potential target 
sequence corresponds to a sequence of the form AAGNig. According to yet other 
5 embodiments of the invention a potential target sequence corresponds to a sequence 
of the form AAGNisTT. Once a potential target is selected, the sequence GNis is 
used as the sequence for the stem (duplex) portion of the shRNA. Thus in certain 
embodiments of the invention the GN18 is preferably be surrounded by AA — TT in 
the context of the mRNA. Where the U6 promoter is used, a 5' guanine is generally 

10 required due to the constraints of this promoter. It may be useful to test 4-5 targets for 
each transcript or gene of interest. It may be desirable to perform a database search 
(e.g., BLAST search) using the GNjg sequences to verify that the sequence is unique 
in order to avoid silencing other genes in addition to the target gene. As lentivirus 
pseudotyped with VS V-G is capable of infecting human cells, if the lentivirus is not 

1 5 intended for use in humans it may be deisrable to determine if there are human genes 
that may be silenced. If so, it may be preferable to avoid sequences that would target 
such genes. 

[00180] According to certain embodiments of the invention the sequence 
TTCAAGAGA (SEQ ID NO:10) is selected for the loop. Thus to design the 

20 complete hairpin sequence according to certain embodiments of the invention, a 19 nt 
sequence suitable as the inhibitory portion of a typical siRNA is selected, optionally 
including an additional two nucleotides such as AA at the 5 5 end in order to generate a 
3 5 UU overhang in the resulting shRNA. A loop sequence is added at the 3' end of 
the 19 (or 21) nt sequence, followed by a sequence complementary to the 19 nt (or 21) 

25 sequence, resulting in a stem-loop after hybridization. See Example 3 for additional 
information. Any of a variety of other sequences may be selected for the loop 
including, but not limited to, loops used in the shRNAs described in Brummelkamp, 
et al, Paddison, et al, Sui, et al 9 Yui, et al 9 or Paul, et al 

[00181] The invention provides a method of reducing or inhibiting expression of a 
30 target transcript in a cell comprising: (i) delivering a lentiviral vector to the cell, 

wherein presence of the lentiviral vector within the cell results in transcription of one 
or more RNAs that self-hybridize or hybridize with each other to form an shRNA or 
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siRNA that is targeted to the target transcript and reduces expression of the transcript. 
The lentiviral vector comprises a nucleic acid segment that provides a template for 
synthesis of one or more RNAs that self-hybridize or hybridize with each other to 
form an shRNA or siRNA that is targeted to the transcript and reduces expression of 
5 the transcript. In certain preferred embodiments of the invention the nucleic acid 
provides a template for synthesis of an RNA that hybridizes to form an shRNA. 
[00182] According to certain embodiments of the invention the cell is a 
mammalian cell. Any of the lentiviral transfer plasmids or lentiviral particles 
described above may be used, wherein presence of the plasmid or particle in a cell 

10 provides a template for synthesis of one or more RNAs that self-hybridize or 

hybridize with each other to form a shRNA or siRNA targeted to the transcript of 
interest. According to certain embodiments of the invention the delivering step 
comprises delivering the lentiviral transfer plasmid or lentiviral particle to a 
mammalian subject, thereby delivering the lentiviral transfer plasmid or lentiviral 

1 5 particle to a cell that is present within the body of the subject. According to certain 
embodiments of the invention the cell is a primary cell. By "primary cell" is meant a 
cell that has been removed from the body of a subject and maintained in tissue culture 
for less than approximately 1, 2, 3, 4, or 5 doubling periods or a non-immortalized 
cell. According to certain embodiments of the invention the mammalian cell is a 

20 nondividing cell, e.g., a terminally differentiated T cell, neuron, hepatocyte, retinal 
cell, skeletal myocyte, cardiac myocyte, keratinocyte, macrophage, etc. The 
mammalian cell may be a human cell or a nonhuman (e.g., mouse or rat) cell. 
According to certain embodiments of the invention the mammalian cell is an 
embryonic cell or an embryonic stem cell. 

25 [001 83] The invention further provides a method for reversibly inhibiting or 

reducing expression of a target transcript in a cell comprising: delivering a lentiviral 
vector to the cell, wherein the lentiviral vector comprises a nucleic acid segment that 
provides a template for synthesis of one or more RNAs that self-hybridize or 
hybridize with each other to form an shRNA or siRNA, which shRNA or siRNA is 

30 targeted to the target transcript and reduces expression of the transcript, wherein the 
nucleic acid segment is located between sites for a site-specific recombinase; and (ii) 
inducing expression of the site-specific recombinase within the cell, thereby 
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preventing synthesis of at least one of the RNAs. According to certain embodiments 
of the invention the cell is a mammalian cell. According to certain embodiments of 
the invention the recombinase is Cre. According to certain embodiments of the 
invention the step of inducing the site-specific recombinase comprises introducing a 
5 vector encoding the site-specific recombinase into the cell. According to other 

embodiments of the invention a nucleic acid encoding the site-specific recombinase is 
operably linked to an inducible promoter, and the inducing step comprises inducing 
the promoter within a cell containing the nucleic acid, as described above. 
[00184] In addition, the invention provides a variety of methods for reversibly 

1 0 inhibiting or reducing expression of a target transcript in a conditional and/or tissue- 
specific manner. For example, the invention provides a method for reversibly 
inhibiting or reducing expression of a transcript in a mammal in a cell type specific or 
tissue-specific manner comprising: (i) delivering a lentiviral vector to cells of the 
mammal, wherein the lentiviral vector comprises a nucleic acid segment that provides 

15 a template for synthesis of one or more RNAs that self-hybridize or hybridize with 
each other to form an shRNA or siRNA, which shRNA or siRNA is targeted to the 
target transcript and reduces expression of the transcript, and wherein the nucleic acid 
segment is located between sites for a site-specific recombinase; and (ii) inducing 
expression of the site-specific recombinase in a subset of the cells of the mammal, 

20 thereby preventing synthesis of at least one of the RNAs within those cells. 

According to certain embodiments of the inventive methods the recombinase is Cre. 
According to certain embodiments of the invention the step of inducing the site- 
specific recombinase comprises introducing a vector encoding the site-specific 
recombinase into a subset of the cells of the subject, e.g., by utilizing a vector that 

25 requires a receptor present only on a subset of the cells. According to other 

embodiments of the invention a nucleic acid encoding the site-specific recombinase is 
operably linked to an inducible promoter, and the inducing step comprises inducing 
the promoter within cells containing the nucleic acid, as described above, whereby 
expression of the target transcript is restored only in cells or tissues in which the 

30 promoter is active. 

[00185] In certain embodiments of the invention the nucleic acid encoding the site- 
specific recombinase is operably linked to a cell type or tissue-specific promoter, so 



Page 63 of 171 



WO 2004/022722 



PCT/US2003/028111 



that synthesis of the recombinase takes place only in cells or tissues in which that 
promoter is active, whereby expression of the target transcript is restored only in cells 
or tissues in which the promoter is active. 

[00186] In certain preferred embodiments of the invention, the promoter utilized to 
5 direct expression of the one or more RNAs that self-hybridize or hybridize with each 
other to form an shRNA or siRNA is a promoter for RNA polymerase III (Pol III). 
Pol HI directs synthesis of small transcripts that terminate within a stretch of 4-5 T 
residues. Certain Pol IH promoters such as the U6 or HI promoters do not require 
exacting regulatory elements (other than the first transcribed nucleotide) within the 

10 transcribed region and thus are preferred according to certain embodiments of the 
invention since they readily permit the selection of desired RNA sequences. In the 
case of naturally occurring U6 promoters the first transcribed nucleotide is guanosine, 
while in the case of naturally occurring HI promoters the first transcribed nucleotide 
is adenine. (See, e.g., Medina MF and Joshi S., Curr Opin Mol Titer 1999 

15 Oct;l(5):580-94; Yu, J., et aL, Proc. Natl Acad. ScL, 99(9), 6047-6052 (2002); Sui, 
G., et aL, Proc. Natl Acad. Set, 99(8), 5515-5520 (2002); Paddison, P., et aL, Genes 
and Dev., 16, 948-958 (2002); Brummelkamp, T., et aL, Science, 296, 550-553 
(2002); Miyagashi, M. and Taira, K., Nat. Biotech, 20, 497-500 (2002); Paul, C, et 
aL, Nat. Biotech., 20, 505-508 (2002); Tuschl, T., et aL, Nat. Biotech., 20, 446-448 

20 (2002)). Thus in certain embodiments of the invention, e.g., where transcription is 
driven by a U6 promoter, the 5' nucleotide of preferred RNA sequences for formation 
of shRNAs or siRNAs is G. In certain other embodiments of the invention, e.g., 
' where transcription is driven by an HI promoter, the 5' nucleotide may be A. The 
lentiviral transfer plasmid may be created by inserting a cassette comprising the RNA 

25 sequence into a transfer plasmid optimized for RNAi that already contains a suitable 
promoter, e.g., a plasmid such as pLL3.7. Alternately, a cassette comprising the RNA 
sequence operably linked to a suitable promoter may be inserted into a transfer 
plasmid that lacks such a promoter, e.g., a plasmid such as pLL3.0. 
[00187] The invention thus encompasses administration of a lentiviral vector to a 

30 cell, e.g., a mammalian cell, to inhibit or reduce expression of any target transcript or 
gene, wherein the lentiviral vector comprises a nucleic acid segment that provides a 
template for synthesis of one or more RNAs that self-hybridize or hybridize to form 
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an shRNA or siRNA that is targeted to the target transcript or gene. In general, the 
nucleic acid segment may provide a template for synthesis of any RNA structure 
capable of being processed in vivo to a shRNA or siRNA, so long as the RNA does 
not induce other negative events such as induction of the interferon response. In 
5 certain preferred embodiments of the invention the nucleic acid segment provides a 
template for synthesis of an RNA that self-hybridizes to form an shRNA targeted to 
the target transcript. 

[00188] As discussed above, in addition to their use for synthesis of RNAs that 
self-hybridize to form shRNAs, the lentiviral vectors of the invention may be used for 

10 synthesis of various other RNAs that mediate RNAi. In particular, two separate 

approximately 21 nt RNA strands may be generated, each of which contains a 19 nt 
region complementary to the other, and the individual strands may hybridize together 
to generate an siRNA structure. Accordingly, the invention encompasses a lentiviral 
vector comprising two transcribable regions, each of which provides a template for 

1 5 synthesis of a transcript containing a region complementary with the other. Generally 
each transcript will be approximately 21 nt in length and the complementary regions 
will be approximately 19 nt in length, as described above. 
[00189] In addition, the invention provides a lentiviral vector that contains 
oppositely directed promoters flanking a nucleic acid segment and positioned so that 

20 two different transcripts, approximately 21 nt in length and having complementary 
regions approximately 19 nt in length, are generated. It will be appreciated that 
appropriate terminators should be supplied in these cases. In cases in which the RNA 
structure undergoes one or more processing steps, those of ordinary skill in the art 
will appreciate that the nucleic acid segment will preferably be designed to include 

25 sequences that may be necessary for processing of the RNA. Figure 1 6 presents a 
schematic diagram of such a plasmid. 

[00190] A large number of variations on the above are possible. For example, the 
lentiviral vector may comprise multiple nucleic acid segments, each of which 
provides a template for synthesis of one or more RNAs that self-hybridize or 
30 hybridize with each other to form shRNAs or siRNAs, which shRNAs or siRNAs may 
target the same transcript or different transcripts. In addition, according to certain 
embodiments of the invention the nucleic acid segment provides a template for 
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synthesis of a plurality of RNAs that self-hybridize or hybridize with each other to 
form a plurality of siRNAs or siRNA precursors. For example, a single promoter may 
direct synthesis of a single RNA transcript containing multiple self-complementary 
regions, each of which may hybridize to generate a plurality of stem-loop structures. 
5 These structures may be cleaved in vivo, e.g., by DICER, to generate multiple 
different siRNAs. It will be appreciated that such transcripts preferably contain a 
termination signal at the 3' end of the transcript but not between the individual siRNA 
units. 

[00191] The present invention encompasses any cell manipulated to contain an 
10 inventive lenti viral transfer plasmid, lentiviral particle, or lenti viral genome derived 
therefrom (e.g., a provirus), wherein the lentiviral transfer plasmid, lentiviral particle, 
or lentiviral genome provides a template for synthesis of one or more RNAs that self- 
hybridize or hybridize to form an shRNA or siRNA. Preferably, the cell is a 
mammalian cell. According to certain embodiments of the invention the cell is a 
15 human cell. Those of ordinary skill in the art will appreciate that intracellular 

expression of RNAs that self-hybridize or hybridize with each other to form shRNAs 
or siRNAs according to the present invention may allow the production of cells that 
produce the shRNA or siRNA over long periods of time (e.g., greater than a few days, 
preferably at least several months, more preferably at least a year or longer, possibly a 
20 lifetime). 

[00192] In certain embodiments of the invention, the cells are non-human cells 
within an organism. For example, the present invention encompasses transgenic 
animals the cells of which contain an inventive lentiviral transfer plasmid, lentiviral 
particle, or lentiviral genome derived therefrom, wherein the lentiviral transfer 

25 plasmid, lentiviral particle, or lentiviral genome provides a template for synthesis of 
one or more RNAs that self-hybridize or hybridize to form an shRNA or siRNA in 
one or more cell types or tissues of the transgenic animal. The invention therefore 
provides a transgenic animal, one or more cells of which comprise a heterologous 
nucleic acid segment provided by a lentiviral vector, wherein the lentiviral vector 

30 comprises (i) a functional packaging signal; (ii) a multiple cloning site (MCS); and 
(iii) at least one additional element selected from the group consisting of: a second 
MCS, a second MCS into which a heterologous promoter or promoter-enhancer is 
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inserted, an HIV FLAP element, an expression-enhancing posttranscriptional 
regulatory element, a target site for a site-specific recombinase, and a self-inactivating 
(SIN) LTR . According to certain embodiments of the invention the cells of the 
transgenic animal contain a heterologous nucleic acid segment that comprises sites for 
5 a site-specific recombinase. 

[00193] As described in Example 7, the inventors have generated transgenic mice 
using a variety of lentiviral particles each comprising a first heterologous nucleic acid 
segment encoding the fluorescent protein GFP, and also a second heterologous 
nucleic acid segment that provides a template for synthesis of an RNA that self- 

10 hybridizes to form an shRNA targeted to a target transcript. The lentiviral particles 
were able to induce expression of GFP within embryonic stem cells (ES cells), and 
these ES cells gave rise to transgenic animals whose cells expressed GFP. 
Furthermore, expression of the particular target transcript corresponding to the second 
nucleic acid segment was reduced or inhibited in cells of the transgenic mice. These 

15 results demonstrate that the lentiviral transfer plasmids and lentiviral particles of the 
invention may be used to generate transgenic animals in which expression of a target 
transcript is reduced or inhibited. It is noted that the lentiviral vectors of the invention 
may thus generally be used a Afunctional vectors, leading both to expression of a 
heterologous nucleic acid and silencing of an endogenous gene. 

20 [00194] Kits 

[00195] The invention provides a variety of kits comprising one or more of the 
lentiviral transfer plasmids of the invention. For example, the invention provides a 
kit comprising (a) a lentiviral transfer plasmid comprising a nucleic acid sequence 
including (i) a functional packaging signal; (ii) a multiple cloning site (MCS) into 

25 which a heterologous gene may be inserted; and (iii) at least one additional element 
selected from the group consisting of: a second MCS, an HIV FLAP element, a 
heterologous promoter, a heterologous enhancer, an expression-enhancing 
posttranscriptional regulatory element, a target site for a site-specific recombinase, 
and a self-inactivating (SIN) LTR; and one or more of the following items: (i) a 

30 packaging mix comprising one or more plasmids that collectively provide nucleic acid 
sequences coding for retroviral or lentiviral Gag and Pol proteins and an envelope 
protein. The packaging mix may contain two or more plasmids. According to certain 
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embodiments of the invention the packaging mix includes two plasmids, one of which 
provides nucleic acid sequences coding for Gag and Pol and the other of which 
provides nucleic acid segments coding for an envelope protein; (ii) cells (e.g., a cell 
line) that are permissive for production of lentiviral particles such as 293T cells; (iii) 

5 packaging cells, e.g., a cell line that is permissive for production of lentiviral particles 
and provides the proteins Gag, Pol, Env, and, optionally, Rev; (iv) cells suitable for 
use in titering lentiviral particles; a transfection-enhancing agent such as 
Lipofectamine; (v) a selection agent such as an antibiotic, preferably corresponding to 
an antibiotic resistance gene in the lentiviral transfer plasmid; (vi) instructions for use; 

0 (vii) a lentiviral transfer plasmid comprising a heterologous nucleic acid segment such 
as a reporter gene that may serve as a positive control (referred to as a "positive 
control plasmid"). 

[00196] According to certain embodiments of the invention the kit contains a set of 
lentiviral transfer plasmids comprising a variety of different heterologous promoters 
5 and/or reporter genes. For example, the kit may contain a set of two or more vectors 
selected from the group consisting of the plasmids of: SEQ ID NO: 2, SEQ ID NO: 3, 
SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 8, and 
SEQ ID NO: 9. 

[00197] Therapeutic Applications and Pharmaceutical Formulations 
0 [00198] The lentiviral vectors of the invention are useful for a wide variety of 
therapeutic applications. In particular, they are useful in any context for which gene 
therapy is contemplated. For example, lentiviral vectors comprising a heterologous 
nucleic acid segment operably linked to a promoter are useful for any disease or 
clinical condition associated with reduction or absence of the protein encoded by the 
5 heterologous nucleic acid segment, or any disease or clinical condition that can be 
effectively treated by augmenting the expression of the encoded protein within the 
subject. For example, lentiviral vectors comprising a nucleic acid segment encoding 
the cystic fibrosis transmembrane conductance regulator (CFTR) or encoding al- 
antitrypsin may be used for the treatment of cystic fibrosis and a 1 -antitrypsin 
0 deficiency, respectively. Lentiviral vectors comprising a nucleic acid segment 
encoding Factor VIII or Factor DC may be used for treatment of hemophilia A or B, 
respectively. See the Web site having URL www.wiley.co.uk/genetherapy/clinical/ 
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(visited October 19, 2002) for a representative list of current gene therapy clinical 
trials involving expression of a therapeutic protein in a subject in need of treatment. 
[00199] Inventive lentiviral vectors capable of causing intracellular synthesis of 
inhibitory RNAs (siRNAs or shRNAs) are useful in treating any disease or clinical 
5 condition associated with overexpression of a transcript or its encoded protein in a 
subject, or any disease or clinical condition that may be treated by causing reduction 
of a transcript or its encoded protein in a subject. For example, many cancers are 
associated with overexpression of oncogene products. Delivering a lentiviral vector 
that provides a template for synthesis of one or more RNAs that self-hybridize or 

10 hybridize with each other to form an shRNA or siRNA targeted to the transcript 

encoding the oncogene product may be used to treat such cancers. The high degree of 
specificity achieved by RNA interference suggests that it is possible to selectively 
target transcripts containing single base pair mutations while not interfering with 
expression of the normal cellular allele. Lenviral vectors that provide a template for 

1 5 synthesis of one or more RNAs that self-hybridize or hybridize with each other to 

form an shRNA or siRNA targeted to a transcript encoding a cytokine may be used to 
regulate immune system responses (e.g., responses responsible for organ transplant 
rejection, allergy, autoimmune diseases, inflammation, etc.). Lentiviral vectors that 
provide a template for synthesis of one or more RNAs that self-hybridize or hybridize 

20 with each other to form an shRNA or siRNA targeted to a transcript of an infectious 
agent or targeted to a cellular transcript whose encoded product is necessary for or 
contributes to any aspect of the infectious process may be used in the treatment of 
infectious diseases. 

[00200] Gene therapy protocols may involve administering an effective amount of 
25 a lentiviral vector whose presence within a cell results in production of a therapeutic 
siRNA or shRNA to a subject either before, substantially contemporaneously, with, or 
after the onset of a condition to be treated. Another approach that may be used 
alternatively or in combination with the foregoing is to isolate a population of cells, 
e.g., stem cells or immune system cells from a subject, optionally expand the cells in 
30 tissue culture, and administer a lentiviral vector whose presence within a cell results 
in production of a therapeutic siRNA or shRNA to the cells in vitro. The cells may 
then be returned to the subject, where, for example, they may provide a population of 
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cells that produce a therapeutic shRNA or siRNA, or that are resistant to infection by 
an infectious organism, etc. Optionally, cells expressing the therapeutic shRNA or 
siRNA can be selected in vitro prior to introducing them into the subject. In some 
embodiments of the invention a population of cells, which may be cells from a cell 
5 line or from an individual other than the subject, can be used. Methods of isolating 
stem cells, immune system cells, etc., from a subject and returning them to the subject 
are well known in the art. Such methods are used, e.g., for bone marrow transplant, 
peripheral blood stem cell transplant, etc., in patients undergoing chemotherapy. 
[00201] Compositions comprising lenti viral vectors of the invention may provide a 

10 template for a single siRNA or shRNA species, targeted to a single site in a single 
target transcript, or alternatively may provide templates for a plurality of different 
siRNA or shRNA species, targeted to one or more sites in one or more target 
transcripts. In some embodiments of the invention, it will be desirable to utilize 
compositions comprising one or more lentiviral vectors, wherein presence of the 

1 5 lentiviral vector(s) within a cell or within different cells in the body, results in 

production of a plurality of different siRNA or shRNA species targeted to different 
genes, which may be cellular genes or, where an infection is being treated, genes of an 
infectious organism. Also, some embodiments will provide templates for more than 
one siRNA or shRNA species targeted to a single transcript. To give but one 

20 example, it may be desirable to provide templates for synthesis of one or more RNAs 
that self-hybridize or hybridize with each other to form at least one siRNA or shRNA 
targeted to coding regions of a target transcript and at least one siRNA or shRNA 
targeted to the 3' UTR. This strategy may provide extra assurance that products 
encoded by the relevant transcript will not be generated because at least one siRNA or 

25 shRNA will target the transcript for degradation while at least one other inhibits the 
translation of any transcripts that avoid degradation. The invention encompasses 
"therapeutic cocktails", including approaches in which a single lentiviral particle 
provides templates for synthesis of one or more RNAs that self-hybridize or hybridize 
to form shRNAs or siRNAs that inhibit multiple target transcripts. 

30 [00202] It may be desirable to combine the administration of inventive lentiviral 
vectors with one or more additional therapeutic agents. The invention therefore 
encompasses compositions comprising a lentiviral vector of the invention, preferably 
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a lentiviral particle, and a second therapeutic agent, e.g., a composition approved by 
the U.S. Food and Drug Administration. 

[00203] Inventive compositions may be formulated for delivery by any available 
route including, but not limited to parenteral (e.g., intravenous), intradermal, 
5 subcutaneous, oral (e.g., inhalation), transdermal (topical), transmucosal, rectal, and 
vaginal. Preferred routes of delivery include parenteral, transmucosal, rectal, and 
vaginal. Inventive pharmaceutical compositions typically include a lentiviral vector 
in combination with a pharmaceutically acceptable carrier. As used herein the 
language "pharmaceutically acceptable carrier" includes solvents, dispersion media, 

10 coatings, antibacterial and antifungal agents, isotonic and absorption delaying agents, 
and the like, compatible with pharmaceutical administration. Supplementary active 
compounds can also be incorporated into the compositions. 
[00204] A pharmaceutical composition is formulated to be compatible with its 
intended route of administration. Solutions or suspensions used for parenteral, 

15 intradermal, or subcutaneous application can include the following components: a 
sterile diluent such as water for injection, saline solution, fixed oils, polyethylene 
glycols, glycerine, propylene glycol or other synthetic solvents; antibacterial agents 
such as benzyl alcohol or methyl parabens; antioxidants such as ascorbic acid or 
sodium bisulfite; chelating agents such as ethylenediaminetetraacetic acid; buffers 

20 such as acetates, citrates or phosphates and agents for the adjustment of tonicity such 
as sodium chloride or dextrose. pH can be adjusted with acids or bases, such as 
hydrochloric acid or sodium hydroxide. The parenteral preparation can be enclosed 
in ampoules, disposable syringes or multiple dose vials made of glass or plastic. 
[00205] Pharmaceutical compositions suitable for injectable use typically include 

25 sterile aqueous solutions (where water soluble) or dispersions and sterile powders for 
the extemporaneous preparation of sterile injectable solutions or dispersion. For 
intravenous administration, suitable carriers include physiological saline, 
bacteriostatic water, Cremophor EL™ (BASF, Parsippany, NJ) or phosphate buffered 
saline (PBS). In all cases, the composition should be sterile and should be fluid to the 

30 extent that easy syringability exists. Preferred pharmaceutical formulations are stable 
under the conditions of manufacture and storage and must be preserved against the 
contaminating action of microorganisms such as bacteria and fungi. In general, the 
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relevant carrier can be a solvent or dispersion medium containing, for example, water, 
ethanol, polyol (for example, glycerol, propylene glycol, and liquid polyetheylene 
glycol, and the like), and suitable mixtures thereof. The proper fluidity can be 
maintained, for example, by the use of a coating such as lecithin, by the maintenance 
of the required particle size in the case of dispersion and by the use of surfactants. 
Prevention of the action of microorganisms can be achieved by various antibacterial 
and antifungal agents, for example, parabens, chlorobutanol, phenol, ascorbic acid, 
thimerosal, and the like. In many cases, it will be preferable to include isotonic 
agents, for example, sugars, polyalcohols such as manitol, sorbitol, sodium chloride in 
the composition. Prolonged absorption of the injectable compositions can be brought 
about by including in the composition an agent which delays absorption, for example, 
aluminum monostearate and gelatin. 

[00206] Sterile injectable solutions can be prepared by incorporating the active 
compound in the required amount in an appropriate solvent with one or a combination 
of ingredients enumerated above, as required, followed by filtered sterilization. 
Generally, dispersions are prepared by incorporating the active compound into a 
sterile vehicle which contains a basic dispersion medium and the required other 
ingredients from those enumerated above. In the case of sterile powders for the 
preparation of sterile injectable solutions, the preferred methods of preparation are 
vacuum drying and freeze-drying which yields a powder of the active ingredient plus 
any additional desired ingredient from a previously sterile-filtered solution thereof. 
[00207] Oral compositions generally include an inert diluent or an edible carrier. 
For the purpose of oral therapeutic administration, the active compound can be 
incorporated with excipients and used in the form of tablets, troches, or capsules, e.g., 
gelatin capsules. Oral compositions can also be prepared using a fluid carrier for use 
as a mouthwash. Pharmaceutically compatible binding agents, and/or adjuvant 
materials can be included as part of the composition. The tablets, pills, capsules, 
troches and the like can contain any of the following ingredients, or compounds of a 
similar nature: a binder such as microcrystalline cellulose, gum tragacanth or gelatin; 
an excipient such as starch or lactose, a disintegrating agent such as alginic acid, 
Primogel, or com starch; a lubricant such as magnesium stearate or Sterotes; a glidant 
such as colloidal silicon dioxide; a sweetening agent such as sucrose or saccharin; or a 
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flavoring agent such as peppermint, methyl salicylate, or orange flavoring. 
Formulations for oral delivery may advantageously incorporate agents to improve 
stability within the gastrointestinal tract and/or to enhance absorption. 
[00208] For administration by inhalation, the inventive lentiviral vectors are 
5 preferably delivered in the form of an aerosol spray from pressured container or 

dispenser which contains a suitable propellant, e.g., a gas such as carbon dioxide, or a 
nebulizer. 

[00209] Systemic administration can also be by transmucosal or transdermal 
means. For transmucosal or transdermal administration, penetrants appropriate to the 

10 barrier to be permeated are used in the formulation. Such penetrants are generally 
known in the art, and include, for example, for transmucosal administration, 
detergents, bile salts, and fiisidic acid derivatives. Transmucosal administration can 
be accomplished through the use of nasal sprays or suppositories. For transdermal 
administration, the active compounds are formulated into ointments, salves, gels, or 

1 5 creams as generally known in the art. 

[00210] The compounds can also be prepared in the form of suppositories (e.g., 
with conventional suppository bases such as cocoa butter and other glycerides) or 
retention enemas for rectal delivery. 

[00211] In one embodiment, the active agents, i.e., a lentiviral vector of the 
20 invention and/or other agents to be administered together with a lentiviral vector of 
the invention, are prepared with carriers that will protect the compound against rapid 
elimination from the body, such as a controlled release formulation, including 
implants and microencapsulated delivery systems. Biodegradable, biocompatible 
polymers can be used, such as ethylene vinyl acetate, polyanhydrides, polyglycolic 
25 acid, collagen, polyorthoesters, and polylactic acid. Methods for preparation of such 
formulations will be apparent to those skilled in the art. The materials can also be 
obtained commercially from Alza Corporation and Nova Pharmaceuticals, Inc. 
Liposomal suspensions (including liposomes targeted to infected cells with 
monoclonal antibodies to viral antigens) can also be used as pharmaceutically 
30 acceptable carriers. These can be prepared according to methods known to those 
skilled in the art, for example, as described in U.S. Patent No. 4,522,81 1. 
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[00212] It is advantageous to formulate oral or parenteral compositions in dosage 
unit form for ease of administration and uniformity of dosage. Dosage unit form as 
used herein refers to physically discrete units suited as unitary dosages for the subject 
to be treated; each unit containing a predetermined quantity of active compound 
5 calculated to produce the desired therapeutic effect in association with the required 
pharmaceutical carrier. 

[00213] Toxicity and therapeutic efficacy of such compounds can be determined 
by standard pharmaceutical procedures in cell cultures or experimental animals, e.g., 
for determining the LD 5 o (the dose lethal to 50% of the population) and the ED50 (the 

10 dose therapeutically effective in 50% of the population). The dose ratio between 
toxic and therapeutic effects is the therapeutic index and it can be expressed as the 
ratio LD 50 / ED 50 . Compounds which exhibit high therapeutic indices are preferred. 
While compounds that exhibit toxic side effects can be used, care should be taken to 
design a delivery system that targets such compounds to the site of affected tissue in 

15 order to minimize potential damage to uninfected cells and, thereby, reduce side 
effects. 

[00214] The data obtained from cell culture assays and animal studies can be used 
in formulating a range of dosage for use in humans. The dosage of such compounds 
lies preferably within a range of circulating concentrations that include the ED 50 with 

20 little or no toxicity. The dosage can vary within this range depending upon the 

dosage form employed and the route of administration utilized. For any compound 
used in the method of the invention, the therapeutically effective dose can be 
estimated initially from cell culture assays. A dose can be formulated in animal 
models to achieve a circulating plasma concentration range that includes the IC50 (i.e., 

25 the concentration of the test compound which achieves a half-maximal inhibition of 
symptoms) as determined in cell culture. Such information can be used to more 
accurately determine useful doses in humans. Levels in plasma can be measured, for 
example, by high performance liquid chromatography. 

[00215] The pharmaceutical composition can be administered at various intervals 
30 and over different periods of time as required, e.g., one time per week for between 

about 1 to 10 weeks, between 2 to 8 weeks, between about 3 to 7 weeks, about 4, 5, or 
6 weeks, etc. For certain conditions such as HIV it may be necessary to administer 
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the therapeutic composition on an indefinite basis to keep the disease under control. 
The skilled artisan will appreciate that certain factors can influence the dosage and 
timing required to effectively treat a subject, including but not limited to the severity 
of the disease or disorder, previous treatments, the general health and/or age of the 
5 subject, and other diseases present. Generally, treatment of a subject with a lentiviral 
vector as described herein, can include a single treatment or, in many cases, can 
include a series of treatments. 

[00216] Exemplary doses for administration of gene therapy vectors are known in 
the art. It is furthermore understood that appropriate doses of a lentiviral vector that 

10 provides a template for synthesis of one or more RNAs that self-hybridize or 

hybridize with each other to form an shRNA or siRNA may, in general, depend upon 
the potency of the siRNA or shRNA and may optionally be tailored to the particular 
recipient, for example, through administration of increasing doses until a preselected 
desired response is achieved. It is understood that the specific dose level for any 

15 particular animal subject may depend upon a variety of factors including the activity 
of the specific compound employed, the age, body weight, general health, gender, and 
diet of the subject, the time of administration, the route of administration, the rate of 
excretion, any drug combination, and the degree of expression or activity to be 
modulated. 

20 [00217] Lentiviral gene therapy vectors can be delivered to a subject by, for 

example, intravenous injection, local administration, or by stereotactic injection (see 
e.g., Chen et al (1994) Proc. Natl Acad. Set USA 91:3054-3057). In certain 
embodiments of the invention the vectors may be delivered orally or inhalationally 
and may be encapsulated or otherwise manipulated to protect them from degradation, 

25 enhance uptake into tissues or cells, etc. The pharmaceutical preparation can include 
the lentiviral vector in an acceptable diluent, or can comprise a slow release matrix in 
which the lentiviral vector is imbedded. Alternatively, where the vector can be 
produced intact from recombinant cells, as is the case for retroviral or lentiviral 
vectors as described herein, the pharmaceutical preparation can include one or more 

30 cells which produce the vectors. 

[00218] Inventive pharmaceutical compositions can be included in a container, 
pack, or dispenser together with instructions for administration. 
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Exemplification 

[00219] Example 1: Generation of pLentiLox Vectors 
5 This example describes generation of the pLentiLox family of vectors. Unless 

otherwise indicated, standard molecular biology techniques were generally performed 
in accordance with guidance found in Current Protocols in Molecular Biology, 
edition as of 2001; or in Sambrook, Russell, and Sambrook, Molecular Cloning: A 
Laboratory Manual, 3 rd ed., Cold Spring Harbor Laboratory Press, Cold Spring 

1 0 Harbor, 200 1 , or according to instructions provided by the manufacturer of the 
relevant reagents or kits. It is noted that a variety of different approaches to 
generating the constructs described below as well as alternative sources for the 
elements incorporated into the constructs may be employed. In particular, the 
sequence information provided herein enables one of ordinary skill in the art to 

1 5 chemically synthesize part or all of the constructs, thus offering considerable 
flexibility. 

[00220] Characterization ofpBFGW. Generation of the pLentiLox vector family 
involved extensive modification of the plasmid pBFGW. Accordingly, the first step 
was a thorough characterization of this plasmid. pBFGW is a third generation 

20 lentiviral plasmid based upon the pCDNA 3. 1/Zeo plasmid (Invitrogen) that was 
incompletely characterized and lacked sequence information. pBFGW is a member 
of the same vector family as pFUGW 24 but contains a Beta-actin/CMV hybrid 
promoter rather than a ubiquitin promoter. Lentiviral elements in pBFGW are 
derived from HTV-l. We sequenced this plasmid in its entirety. The sequence is 

25 presented as SEQ ID NO: 1 . A restriction map of pBFGW was generated based upon 
the sequencing information and verified (for several enzymes) by digestion and 
agarose gel electrophoresis and is shown in Figure 1. 

[00221] pBFGW includes a cassette for the generation of lentivirus inserted 
downstream of the CMV promoter of pCDNA3.1/Zeo. The cassette consists of a 5' 
30 self-inactivating (SIN) LTR, the required packaging sequence (Psi) , the HIV FLAP 
element (FLAP), a hybrid promoter consisting of beta-actin and CMV promoter 
sequences, the open reading frame for enhanced Green Fluorescent Protein (EGFP), 
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the Woodchuck Hepatitis Regulatory Element (WRE), and the 3 ' SIN LTR. There 
existed only three unique restriction sites for the introduction of transgenes and/or 
promoters. In addition pBFGW was 10,441 base pairs in length without the 
introduction of a transgene. Plasmids of this size are more difficult to manipulate 
5 then smaller plasmids. There was also no mechanism to eliminate transgene 
expression after infection. 

[00222] Elimination of elements between FLAP and WRE. pBFGW was 
sequentially digested with Pad and EcoRl The 7,930 bp fragment representing the 
backbone was purified by gel purification. The overhanging ends were filled-in by 

10 reaction with Pfii polymerase to generate blunt ends. 

[00223] Introduction of a MCS-LoxP-MCS-LoxP cassette. The plasmid pBluescript 
Lox (pBS-Lox) was created by amplifying two LoxP sites from an unrelated vector 
(pML2MIG) by PGR. The two LoxP PCR products contained a 23bp overlap region 
(containing the restriction sites EcoRl, NotI and Hindlll) at their 5' and 3 5 ends 

15 respectively. These two products were combined in a splicing overlap extension 
(SOE) PCR reaction (Horton, RM, et al 9 Biotechniques, 8(5):528-35, 1990) using 
standard PCR conditions to create the LoxP-MCS-LoxP cassette, that was cloned into 
the filled-in Hindin and NotI sites of pBlueScript (KSII+) as a blunt ended fragment. 
The following primers were used in the PCR reactions: 

20 [00224] PCR product #1 was amplified with the following primers: 
[00225] Ll/5': 5'(tggtgggtacctagtggaacc)3' (SEQ ID NO: 32) 
[00226] Ll/3 f : SXaagcttaagcggccgcagaattcgtcgagggacctaataacgtatagP' (SEQ ID 
NO: 33) 

[00227] PCR product #2 was amplified with the following primers: 
25 [00228] L2/5': 5 , (gaattctgcggccgcttaagcttggaacccttaatataacttcg)3 , (SEQ ID NO: 
34) 

[00229] L2/3': 5'(cgcttcacgagattccagcag)3' (SEQ ID NO: 35) 
[00230] The LoxP-MCS-LoxP cassette was used in the LentiLox cloning described 
below. The second LoxP site of this cassette contained a three base pair deletion. 
30 This deletion was not identified until later in the construction of the LentiLox vectors 
(see below). 
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[00231] pBS-Lox was digested with Asp71 8. A 209bp fragment was isolated by 
gel purification. This fragment contained a 5' multiple cloning site (5* MCS), a LoxP 
site, a 3' multiple cloning site (3' MCS), and a second LoxP site. The overhanging 
ends of the 209bp Asp718 fragment were filled-in with cloned Pfu polymerase, and 
5 ligated into the 7,930 bp fragment of pBFGW. The orientation of the insertion was 
determined both by restriction fragment length polymorphism and by sequencing. A 
plasmid containing the MCS-LoxP-MCS-LoxP cassette with the expected sequence in 
the correct orientation was named pLentiLox 1.0. 

[00232] Elimination of plasmid backbone restriction sites. Several of the sites 
1 0 found within the two MCS 's were present elsewhere in the pLentiLox 1 .0 vector. 
Specifically, NotI, Apal, and Xhol cut both within the MCS cassette as well as once 
elsewhere in the vector. In order to gain the use of these sites we intended to destroy 
each site within the plasmid backbone. 

[00233] pLLl .0 was partially digested with NotI under conditions of limiting 

1 5 enzyme activity (.0625 Units of enzyme per microgram of pLLl .0 incubated at 37 
degrees Celsius for twenty minutes). The 8,142 bp band that represented linearized 
pLLl .0 was isolated via agarose gel electrophoresis followed by gel extraction. This 
linearized fragment was phosphorylated on its 5 5 ends with T4 Polynucleotide Kinase 
(PNK). The overhanging ends of this molecule were then filled-in with cloned Pfu to 

20 destroy the NotI site. The ends were ligated together to circularize the molecule. To 
determine whether a NotI site had been destroyed and to determine which NotI site 
had been destroyed the plasmid was digested with NotI and with Pstl. Destruction of 
the NotI site in the plasmid backbone yielded fragments of 7830 bp and 3 12 bp 
whereas destruction of the MCS NotI site yielded fragments of 7253 and 889 bp 

25 respectively. We accidentally chose a plasmid in which the MCS NotI site was 
destroyed and named it pLentiLox 1.1 (pLLl.l). This was later remedied (see 
below). It should be noted that the creation of pLLl.l was problematic due to 
recombination within the vector resulting in large deletions of required sequences. To 
verify the presence of an intact backbone it was necessary to perform an additional 

30 restriction digest. An enzyme that cut in three places was used, and the digestion 
pattern of pLLl.l was compared with that of pLLl.O to make sure that the two bands 
representing the backbone were the same size. 
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[00234] We designed a strategy to destroy the Apal site that would also eliminate a 
2197 bp fragment between the 3'SIN LTR and the pUC ori that we deemed non- 
essential for lentiviral production. This Apal-Pcil fragment contained a BGH 
polyadenylation site, an S V40 promoter/ori, the Zeomycin resistance gene, and an 
5 SV40 polyadenylation site. We digested pLLl . 1 with Pcil to linearize the plasmid. 
The linearized plasmid was then digested with a limiting amount of Apal (between 
.25 Units and 2 Units per microgram of linearized pLLl.l for twenty minutes at room 
temperature). A 5,945 bp fragment representing a single cut with Apal adjacent to the 
3' LTR was isolated by agarose gel electorphoresis followed by gel purification. The 

10 gel purified fragment was phosphorylated with PNK, filled in with cloned Pfu, and 
circularized by ligation. The ligated DNA was digested with StuI prior to 
transformation into bacteria. Digestion with StuI was expected to specifically cut 
plasmid that contains the 2197 bp fragment that had been eliminated, and thus was 
used to select against contamination with uncut pLLl.l vector. The elimination of 

15 the 2197 bp and the destruction of Pcil and Apal were verified by restriction digest 
and a correct plasmid was identified and named pLentiLox 1 .2 (pLLl .2). 
[00235] To destroy the Xhol site, pLLl .2 was cut with limiting amounts of Xhol 
(.0625 Units per microgram of plasmid for twenty minutes at 37 degrees Celsius). A 
5,947 bp fragment representing single-cut linearized pLL1.2 was isolated via agarose 

20 gel electrophoresis and gel purification. The fragment was 5' phosphorylated with 
PNK, filled in with cloned Pfu, recircularized with ligase, and transformed into 
bacteria. Destruction of the correct Xhol site was verified by restriction digest. A 
correct plasmid was identified and named pLentiLoxl.3 (pLL1.3). 
[00236] Expansion of the 5 ' MCS. The 5' MCS was intended for the insertion of 

25 promoter sequences, among other purposes. After destruction of the sites mentioned 
above two unique cloning sites (Apal and Xhol) remained in this MCS. We derived a 
list of restriction enzymes that failed to cut pLL1.3 to generate a list of candidate sites 
to engineer into an expanded 5 'MCS. We then designed complementary 
oligonucleotides to allow us to introduce Xbal, Hpal, Nhel, and Pad sites between 

30 the Apal and Xhol sites. The oligonucleotides were designed to include two 
nucleotides between adjacent restriction sites. The sequence of the sense 
oligonucleotides was 5' cgctctagacggttaacgcgctagccgttaattaagcc 3' (SEQ ID NO: 1 1). 
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The antisense oligonucleotide was complementary to this sequence but contained an 
additional four nucleotides at the 5' end to produce an Xhol overhang and four 
nucleotides at the 3' end to produce an Apal overhang. The antisense oligonucleotide 
sequence was 5'-tcgaggcttaattaacggctagcgcgttaaccgtctagagcgggcc-3 , (SEQ ID NO: 

5 12). We chose restriction sites to include based upon the following criteria: (1) 
Inclusion of a site for a restriction enzyme that leaves a blunt end after cutting; (2) 
Inclusion of a restriction site that has an 8 bp recognition sequence. (3) Inclusion of 
sites for which enzymes are widely available. (4) Inclusion of sites for enzymes that 
are known to be reliable cutters. 

1 0 [00237] pLLl .3 was digested sequentially with Apal and Xhol. The digest was 
then purified by Qiaquick PCR purification kit (Qiagen) to eliminate the small DN A 
fragment between Apal and Xhol. The fragment was then treated with Shrimp 
Alkaline Phosphatase (SAP) to eliminate 5 '-phosphate groups. The oligonucleotides 
described above were synthesized, 5 5 phosphorylated, and PAGE-purified by IDT 

15 Corp. (See Web site having URL www.idtna.com. 60 picomols of each oligo were 
annealed in annealing buffer (lOOmM Potassium Acetate, 30mM HEPES-KOH pH 
7.4, 2mM Magnesium acetate) by incubation at 95 degrees for 4 minutes, followed by 
70 degrees for 10 minutes, then slowly cooled (.1 degrees/second) to 4 degrees, then 
maintained at 4 degrees for 10 minutes. The annealed oligos were diluted and ligated 

20 at an equimolar concentration with the linearized pLLl .3 vector. A plasmid 
containing the engineered MCS was identified by restriction digest and named 
pLLl .4. 

[00238] It was at this time that we first realized that we had destroyed the NotI site 
in the MCS rather than in the plasmid backbone (see above). The second NotI site 

25 (adjacent to the LTR) was then destroyed in pLLl .4 as follows. pLLl .4 was digested 
with NotI to linearize the plasmid. The ends were 5 5 phosphorylated with PNK and 
were filled-in with cloned Pfu to blunt and destroy the NotI site. The plasmid was 
recircularized by ligation and transformed into bacteria. We checked for destruction 
of the NotI site by restriction digest and named the resulting plasmid pLentiLox 1.5 

30 (pLLl .5). 

[00239] We next sought to expand the 3' MCS. We designed primers to introduce 
Nsil 5 SphI, Smal/Xmal, AscI, and BamHI sites between the NotI and EcoRI sites. 
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We followed the same design criterion as described above. An additional criterion 
was that no three consecutive nucleotides would generate a nonsense codon. This 
would allow us to produce fusion proteins in which MCS sequence can remain 
between the fused proteins while minimizing the likelihood of premature termination 
5 of translation. In addition we wanted a minimum of sites in the MCS to be present in 
the sequences encoding EGFP and dsRed2, which we intended to include in many 
derivatives of our vectors (see below). We purposefully intended to make this MCS 
more versatile then the 5' MCS since we anticipated that most applications of our 
vector would require cloning into this MCS. The inclusion of an SphI site was 
10 fortuitous. The two nucleotide spacer between the Nsil and Smal/Xmal led to the 

creation of an SphI site that overlaps these other two sites. More fortuitously, SphI is 
a unique site in pLLl.5. The oligonucleotide primers incorporating the desired 
restrictions sites are as follows: 

[00240] 3' MCS Sense: 5' ggccgccgatgcatgccccgggatggcgcgccatggatccgcg 3' 
15 (SEQ ID NO: 13) 

[00241] 3' MCS Antisense: 5' aattcgcggatccatggcgcgccatcccggggcatgcatcggc 3' 
(SEQ ID NO: 14) 

[00242] Because we had destroyed the NotI site that should have been present in 
the pLL1.5 3' MCS we had to use a different strategy to insert the oligonucleotides 

20 than was used for the 5' MCS. The pBS-Lox (described above) plasmid was digested 
with NotI and EcoRI enzymes. The small DNA fragment that was liberated was 
eliminated by purifying the linearized pBS-Lox backbone in a Qiaquick PCR 
purification kit. The DNA was SAP treated. The 3' MCS oligonucleotides were 
annealed (see above). 150 fmols of annealed oligonucleotides and cut pLLl .5 were 

25 ligated together and transformed into bacteria. A plasmid containing the expanded 3 ' 
MCS was identified by restriction digest and named pBS-Lox-MCS. 
[00243] To insert the expanded MCS from pBS-Lox-MCS into pLLl .5, we 
replaced the EcoRI-XhoI fragment from pLL1.5 (containing the improperly destroyed 
NotI site) with the EcoRI-XhoI fragment from pBS-Lox-MCS (containing an intact 

30 NotI site and the expanded 3' MCS). A plasmid containing the expanded MCS and 
intact NotI in the pLL backbone was identified by restriction digest and was named 
pLentiLox2.0 (pLL2.0). 
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[00244] Production of useful pLL2. 0 series vectors 

[00245] We next sought to produce constructs that would be useful starting points 
for many potential uses of our lentiviral vectors. One application in which there is 
enormous interest is the generation of fusion proteins in which a protein of interest (or 
5 a portion thereof) is fused with a fluorescent protein. In particular, EGFP and dsRed2 
are fluorescent proteins that are well characterized and for which sequences are 
widely available. 

[00246] pLL2. 1 was engineered to include the EGFP open reading frame. The 
EGFP open reading frame was amplified from pEGFP-Nl (Clontech) to include a 

10 5'NotI site and a 3' Nsil site. The oligonucleotides used were: 

[00247] EGFP/5 'NotI: 5 '-cggcggccgcgccaccatggtgagcaagggc-3 5 (SEQ ID NO: 1 5) 
[00248] EGFP/3 'Nsil: 5 '-cgatgcatcttgtacagctcgtccatgccg«3 ' (SEQ ID NO: 1 6) 
[00249] The PCR product was isolated by agarose gel electrophoresis, gel purified, 
and cloned into the NotI and Nsil sites of pLL2.0 to create pLL2. 1 . 

1 5 [00250] pLL2.2 was engineered to include the dsRed2 open reading frame. The 
dsRed2 open reading was amplified from pdsRed2-Nl (Clontech) to include a 5 'NotI 
site and 3 'Nsil site. The oligonucleotides used were: 

[00251] dsRed2/5'NotI: 5'-cggcggccgcgccaccatggcctcctccgag-3' (SEQ ID NO: 17) 
[00252] dsRed2/3'NsiI: 5'-cgatgcatcaggaacaggtggtggcggccc-3' (SEQ ID NO: 18) 

20 [00253] The PCR product was isolated by agarose gel electorpheresis, gel purified, 
and cloned into the NotI and Nsil sites of pLL2.0 to create pLL2.2. 
[00254] Because the pLentiLox series has a self-inactivating 5' LTR, the pro virus 
has no endogenous 5' promoter activity. Therefore, it is necessary to include an 
internal promoter to drive transgene expression. This makes the system compatible 

25 with tissue-specific promoters. We chose to clone a ubiquitous and constitutive 
promoter into our vector to create a transgenic system that should be active in most 
eukaryotic cell types as well as in all the tissues of a mouse. The promoter we chose 
was the Ubiquitin C promoter (UbC). We first attempted to clone UbC by PCR 
amplification of the promoter from the pUB6/V5/His vector. However, the UbC 

30 sequence was not robustly amplified via PCR (in several attempts). As a second 

strategy we digested the pUB6/V5/His vector with BgUI and Hindin which generates 
a fragment containing the UbC promoter. This fragment was isolated by agarose gel 
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electrophoresis and gel extraction. The fragment was filled-in with cloned Pfu 
polymerase and 5' phosphorylated with PNK. This fragment was ligated into Hpal 
digested pLL2.0, pLL2.1, and pLL2.2 to generate pLL2.3, pLL2.4, and pLL2.5 
respectively. These plasmids were verified by restriction digest to contain the proper 
5 insert. pLL2.4 and pLL2.5 were transfected into 293.T cells and production of the 
correct fluorescent protein was verified by visualizing the transfected cells under an 
epifluorescent microscope 24 hours after transfection. 

[00255] As described in the following example, we later discovered that one of the 
loxP sites in the pLL2.0 series of vectors contained a deletion that rendered it 
10 unusable. In order to generate the pLL3.0 series (which contain two wild type loxP 
sites) from the pLL2.1-pLL2.7 series we cloned Apal-EcoRI inserts of various sizes 
(2.1-917 bp, 2.2-875bp, 2.3-1417bp, 2.4-2,138bp, 2.5-2096bp, 2-.6-1519bp, and 2.7- 
1819bp) from the pLL2.1-pLL2.7 vectors into the 5,831bp Apal-EcoRI backbone 
from pLL3.0 to create pLL3.1-pLL3.7. 

15 

[00256] Example 2: Generation of Lentiviral Vectors for RNAi 
[00257] Modification ofpLL2. 0 for use in RNAi In order to drive expression of an 
RNAi-inducing stem-loop, we decided to incorporate a polin promoter into a 
pLentilox series vector. In addition, we decided to incorporate a poll! promoter to 
20 drive expressin of EGFP as a reporter. Because we were concerned that the 

placement of a strong polll promoter near a poim promoter might interfere with 
poim function we chose to place the polH-EGFP cassette between LoxP sites. This 
would allow us to eliminate the polll promoter if we were failing to accumulate the 
stem-loop RNA. 

25 [00258] We first inserted a cassette to drive expression of EGFP. A DNA 

fragment containing the CMV promoter upstream of the EGFP open reading frame 
was amplified from pEGFP-Cl. The oligonucleotides were selected to engineer a 5' 
NotI site and 3' EcoRI site. The oligonucleotides used were: 
[00259] 5'CMV/NotI: 5'-cggcggccgcgtggataaccgtattaccgccatg-3' (SEQ ID NO: 

30 19) 

[00260] 3 'EGFP/stop/EcoRI: 5 * cggaattcctacttgtacagctcgtccatgccgag-3 ' (SEQ ID 
NO: 20) The PCR product was isolated by agarose gel electrophoresis and purified by 
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gel extraction. The fragment was cloned into the NotI and EcoRI sites of pLL2.0 to 
create pLL2.6. This plasmid was tested by restriction digest and production of EGFP 
was verified by transfection of pLL2.6 into 293.T cells. 

[00261] The insertion of the U6 promoter presented an additional challenge. We 
5 needed to introduce a cloning site for the introduction of RNAi sequences. The U6 
promoter includes sequences required for activity up until the +1 transcriptional start 
site. Therefore, one cannot modify the sequences prior to -1 without incapacitating 
U6. We did not want to introduce a site after +1 as that would dictate that the first 
several nucleotides of the stem-loop would be derived from the restriction site. We 

10 therefore engineered the U6 promoter to introduce an Hpal site that cuts at the -1 

position of U6. The first three nucleotides of the Hpal site are present in the wildtype 
U6. We had to alter the nucleotides at -1 to +2 in order to engineer an Hpal site. As 
a result the U6 promoter is not functional when containing the Hpal site. However 
after digestion with Hpal and introduction of oligonucleotides to code for a stem- 

15 loop, those oligonucleotides can re-generate the wild-type 3 'end of the U6 promoter 
thereby restoring transcriptional activity. We engineered oligonucleotides to add a 
Xbal site to the 5' end of the U6 promoter and Hpal, BstEII, and Xhol sites to the 3' 
end. We cloned the amplified PCR product from the pmU6 plasmid and introduced 
the product into the Xbal and Xhol sites of pLL2.6. The oligonucleotides used were: 

20 [00262] 5' Xbal/U6: 5'-gctctagagatccgacgccgccatctctag-3' (SEQ ID NO: 21) 
[00263] 3 ' XhoI/BstEH/HpaI/U6: 5 *- 

gcctcgagggtcaccgcgcgttaacaaggcttttctccaaggg-3' (SEQ ID NO: 22) 

[00264] The resulting plasmid was verified by both restriction digest and by 

sequencing and was named pLL2.7. 

25 [00265] Repair of LoxP and engineering of new resfriction site 

[00266] It was at this point that we recognized that the 3 * LoxP site in the original 
pBS-Lox contained a three nucleotide deletion that rendered it unusable. We decided 
to fix the 3' LoxP site in the pLL2.0 plasmid and then use this plasmid backbone to 
clone in sequences from pLL2.1-pLL2.7. The repair of the LoxP site gave us an 

30 opportunity to engineer a new restriction site outside of the LoxP site (between the 3 ' 
LoxP and the WRE). This site would give our plasmid series even greater flexibility 
for engineering other additions such as IRES-GFP, or inducible expression systems. 
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[00267] Oligonucleotides were designed to amplify a fragment from pLL2.0 from 
the EcoRI site to a PflMI site located within WRE. The 5 9 oligonucleotide extended 
from the EcoRI site in the 3'MCS through the mutant LoxP site and into the region 
between the LoxP and the WRE. This oligonucleotide was designed to add the 
5 deleted nucleotides to the LoxP site and to create a Pcil site immediately following 
the LoxP site. The amplified DNA was inserted into the pLL2.0 backbone digested 
with EcoRI and PflML The oligonucleotides used were: 
[00268] 5 'EcoRI/LoxFix/Pcil: 

S'-gcgaattcgtcgagggacctaataacttcgtatagcatacattatacgaagttatacatgtttaagggttccgg-S 5 
10 (SEQIDNO:23) 

[00269] 3' PflMl/Rev: 5'-aaggagctgacaggtggtggcaatg-3' (SEQ ID NO: 24) 
[00270] A plasmid was checked by sequencing for the addition of a correct LoxP 
and Pcil sites and named pLL3.0. 

[00271] In order to generate the pLL3.1~pLL3.7 series from the pLL2.1-pLL2.7 
15 series we cloned Apal-EcoRI inserts of various sizes (2.1-917bp, 2.2-875bp ? 2.3- 
1417bp, 2.4-2,138bp, 2.5-2096bp, 2.6-1519bp, and 2.7-1819bp) from the pLL2.1- 
pLL2.7 vectors into the 5,83 lbp Apal-EcoRI backbone from pLL3.0 to create 
pLL3.1-pLL3.7. All plasmids were verified by restriction digest. 

20 [00272] Example 3: Specific Silencing of Genes in T Cells using a Lentiviral 
Vector 

[00273] Materials and Methods 

[00274] Cell culture: E10 and primary mouse splenocyte cultures were performed 
as previously described (11). 293T cells (human fibroblasts) were cultured as 
25 described (21). In vitro T-cell proliferation was performed on 200,000 activated T- 
cells cultured in the presence/absense of increasing doses of IL2 (0 to 100 ng/ml) and 
pulsed for 6 h with [ 3 H]TdR to assay proliferation. 

[00275] Oligonucleotide design. The following approach was used to design 
oligonucleotides suitable for cloning into pLL3.7 vectors to generate vectors capable 
30 of directing synthesis of shRNAs for gene silencing in this and the following 
examples. As described above, we have engineered a multiple cloning site 
immediately following the U6 promoter. An Hpal site leaves a blunt end prior to the - 
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1 position in the promoter. The oligonucleotide design must incorporate a 5' T in 
order to reconstitute the -1 nucleotide of U6. An Xhol site cuts downstream of the U6 
start site. The following oligonucleotide format was used: 
[00276J Sense oligonucleotide: 5 T-(GN1 8)-(TTCAAGAGA)-(8 1NC)-TTTTT 
5 [00277] Antisense oligonucleotide: Complement of sense but with additional 
nucleotides at 5 ' end to generate Xhol overhang. 

The loop sequence (TTCAAGAGA) (SEQ ID NO: 10) is based upon Brummelkamp 
et al. (Science 2002). 

.10 

Oligonuclotides with 5' phosphates and PAGE purified were ordered from Integrated 
DNA Technologies (IDT), Coralville, IA. 

[00278] Generation of lentiviral transfer plasmids containing shKNAs targeted to 
15 CD8. 

[00279] Oligonucleotides having the following sequences were inserted into 
pLL3.7 to produce lentiviral transfer plasmids capable of directing expression of an 
shRNA targeted to CD8. 

[00280] CD8 sense: 5'- tgctacaactactacatgacttcaagagagtcatgtagtagttgtagcttttttg -3 ' 
20 (SEQ ID NO: 36) 

[00281] CD8 antisense: 5'- 

gttacaaaaaagctacaactactacatgactctcttgaagtcatgtagtagttgtagca-3' (SEQ ID NO: 37) 
[00282] The following protocol was used to clone oligonucleotides into pLL3.7 in 
this and the following examples: 
25 [00283] Oligos are resuspended in water at 60pmol/X and annealed as follows. 
Annealing oligos: 
IX, Sense oligo 
IX Antisense oligo 
48X Annealing Buffer 

• 30 

Annealing Buffer Recipe: 
lOOmMK-acetate ' 
30mM HEPES-KOH pH 7.4 
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2mM Mg-acetate 

Incubate at 95° 4min 
70° lOmin 

Decrease temperature to 4° slowly (. 1 °C/min) 
Incubate at 4° 10 min 

pLentiLox 3.7 is digested as follows: 
Digest 1-2 jag with Xhol and Hpal 
Treat with SAP or with CIP 
Purify linearized fragment 
Estimate concentration 

Ligation is performed as follows: 

Ligate linearized product and annealed oligos at equimolar concentration. I typically 
use 60finol of each component in a final concentration of IOjjL. 

Transformation is performed according to standard techniques. The use of an endA" 
strain of E. coli, e.g., STBL-2 cells is strongly recommended. 

Clones are tested for the presence of inserts as follows : 

We have had success testing for insertion of the stem-loop sequence with both colony 
per or by restriction digest. Insertion of insert causes a band shift of ~60bp in an 
Xbal/NotI fragment when compared to parental vector. This can be seen by 2% 
agarose gel electrophoresis. RPML Cell viability immediately after electroporation 
was typically around 60%. 

[00284] Electroporation: For electroporations, 10 pg of LentiLox plasmid were 
added to prechilled 0.4-cm electrode gap cuvettes (Bio-Rad, Hercules, CA). E10 cells 
(1.5 X 10 7 ) were resuspended to 3 x 10 7 cells/ml in cold serum-free RPMI, added to 
the cuvettes, mixed, and pulsed once at 300 mV, 975 ^F with a Gene Pulser 
electroporator II (Bio-Rad). After electroporation, the cells were put into four wells of 
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a 24-well plate, each containing 1 ml of RPML Cell viability immediately after 
electroporation was typically around 60%. 

[00285] Flow cytometry: For flow cytometric analysis described in this and 
subsequent examples, all cells were washed once in FACS buffer (PBS supplemented 
5 with 2% FCS and 0.01% sodium azide), resuspended to 200 \il 9 and stained directly 
with the appropriate antibodies. The stained cells were washed once, then 
resuspended in 100 ^il FACS buffer containing 5 jxg/ml propidium iodide (PI). 
Unstained and singly stained controls were included in every experiment. Cell data 
were collected on a FACSCalibur flow cytometer (BD Biosciences, San Jose, CA) 
10 and four-color analyses (GFP, PE, PI, and allophycocyanin) were done with 

CellQuest software (BD Biosciences). All data were collected by analyses performed 
on at least 2.5 x 10 5 Pi-negative events (viable cells). 

[00286] The following phycoerythrin (PE) conjugated antibodies were used in this 
and the following examples: anti-CD4 (clone RM4-5), anti-CD8oc (clone 53-6.7), 
15 anti-CD25 (clone PC81), anti TCRp (clone H57-597), anti-CD28 (clone 37.51) and 
strepavidin. Allophycocyanin (APC)-conjugated anti-CD8ct and biotin-conjugated 
anti-CD3 were also used for analysis. All antibodies were from BD Pharmingen (San 
Diego, California). All plots shown are gated for viable cells, which were isolated by 
selecting PI" cells. 

20 [00287] Northern blot analysis: For Northern blot analysis, cells were lysed with 
Trizol reagent (Invitrogen), and total cellular RNA was prepared according to the 
manufacturer's instructions. CD4/CD8 probe hybridization and was performed as 
described (1 1). For the small RNA Northern, total RNA (60 fig) was fractionated on a 
10% denaturing polyacrylamide gel and transferred to nylon membrane. The 

25 membrane was hybridized to a probe consisting of a 21nt CD8 siRNA sense strand 5* 
end-labeled with 32 P. A 5' radiolabeled oligonucleotide probe to 5S RNA was used 
to determine equal loading of RNA. The probe for the siRNA CD8 was taken exactly 
from the sense strand in the plL3.7 CD8 stem-loop as described in reference 1 1 . 
[00288] Results 

30 [00289] To determine whether lentiviral vectors could deliver shRNAs and silence 
genes in mammalian cells, the pLL3.7 vector described in Example 2 that carries the 
U6 RNA polymerase HI (polIII) promoter (Fig. 17A) was used. This promoter is 
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known to efficiently transcribe small RNAs that silence gene expression (2, 26). The 
pLL3.7 vector was also engineered to express EGFP as a reporter gene, permitting 
infected cells to be tracked by flow cytometry. EGFP expression was driven by a 
constitutive RNA polymerase II (polll) promoter derived from cytomegalovirus 
(CMV) (Fig. 17B). This promoter is active in most mammalian tissues (27). The 
CMV promoter of pLL3.7 was placed between LoxP sites to allow removal of this 
genetic element if transcript levels from the U6 promoter were not sufficient to 
silence gene expression due, for example, to possible promoter interference that might 
decrease expression of shRNAs in infected cells. 

[00290] To test whether pLL3.7 could be used to silence gene expression in 
mammalian cells, the sequence for a shRNA predicted to target the T cell surface 
molecule, CDS, was introduced into this vector to generate pLL3.7 CD8 (Fig. 17B). 
The CD8 shRNA duplex sequence (SEQ ID NO: 25: 5'- 

TGCTAC AACTACTAC ATGAC-3 ' when expressed in DNA format) was based on 
sequences that we had previously characterized in the CD8 + E10 thymoma cell line 
(11, 28), and that we had shown will specifically downregulate CD8 in these cells 
when introduced as siRNAs. As a first test we electroporated pLL3.7 CD8 or a 
pLL3.7 vector containing a stem loop targeted to an unrelated sequence (CD25T, a 
stem loop targeted to CD25 but containing a mutation resulting in an early 
termination site) into E10 cells, and quantified expression of CD8 in transfected cells 
by flow cytometry. E10 cells that took up pLL3.7 CD8 or pLL3.7 CD25T DNA 
could be identified by flow cytometry based on their expression of GFP, i.e., cells that 
took up the vector became GFP-positive. As shown in Figure 23, GFP + E10 cells 
transfected with pLL3.7 CDS (lower panel) showed on average a 7-fold reduction in 
CD8 levels relative to cells transfected with pLL3.7 CD25T (middle panel). This 
result demonstrated that pLL3.7 CD8 was able to silence expression of CDS in T 
cells. Since we could detect GFP + and CD8-silenced cells, promoter interference did 
not present a major barrier for co-expression of shRNAs and a reporter gene. 

An aspect of the data that should be noted is that although Figure 23 appears 
to suggest that higher levels of GFP correlates with decreased CD8 
expression, which would suggest that more copies of the lentiviral DNA result in 
greater silencing, this is an artifact due to the fact that when subtracting signal to 
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correct for overlap between signals, more signal from the PE (CD8) channel was 
subtracted from the GFP channel than should have been, which makes the cells with 
high GFP appear to have less CD8. In reality, it appears that any expression from the 
U6 promoter, regardless of number of integrants or copies leads to full silencing. A 
5 further complication of this experiment is that GFP expression from the plasmid is 
actually highest after 24 hrs. As a consequence, at 48 hours, there are many silenced 
cells that have become GFP". 

[00291] Example 4: Production of Infectious Lentiviral Particles using Lentiviral 
1 0 Transfer Plasmids Containing shRNA Sequences 
100292] Materials and Methods 

[00293] Cell culture and lentivirus production. Cell culture was performed as in 
Example 3. Lentiviral production was performed as described (24) using packaging 
plasmids pMDL g/pRRE, pCMV VSV-G, and pRSV-REV, described in references 21 
15 and 40. 

[00294] Harvesting and titering lentivirus. Lentivirus was harvested and titered 

according to the following protocol: 

Harvesting: 

1 . Harvest supernatant from cells and spin at 25,000 rpm for 1 .5hrs 
20 2. Remove all liquid, add x volume of PBS (between 15 and 200ul), and allow to sit 
overnight at 4 degrees 

3. Pipette up and down -20 times 

4. Use or aliquot and flash-freeze in liquid N2, store at -80. 
Titering: 

25 

1. Plate 4xl0 5 293.T cells/well in a 6-well plate 12-24 hours prior to titering. It is 
helpful to have an additional well as a negative control that you mock infect with 
DIO+polybrene but without virus. 

2. Make a stock solution of D10 medium with 8|ag/ml polybrene. 

30 3. Generate a 10-fold dilution series of virus in the DIO+polybrene. Using 

1.5mls/well you should have 1^1, .1, .01, .001, .0001, and .OOOOl^iL of virus/well. 
4. Incubate at 37 degrees O/N. Replace media with fresh D10. 



• 
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5. At least 48 hours after infection trypsinize cells for FACS analysis. (Trypsinize, 
inactivate with media, spin, and resuspend in cold PBS). 

6. FACS analyze for EGFP expression and record the percentage of cells that are 
EGFP positive. 

5 7. Use a well that has between .1% and 10% of cells expressing EGFP to determine 
titer. 

Sample calculation assuming 1% infection from the well with .01 jil of virus: .01 
(percentage of cells that are EGFP positive) x4xl0 5 ==4xl0 3 positive cells. 
4 x 10 3 x 100 (dilution factor) = 4 x 10 5 viral particles/ul. 

10 

In general at least 5 x 10 5 viral particles/ul should be used for embryo infections. 

[00295] Transfection of 293 cells. The following protocol was used for 
transfection of 293 cells in this and the following examples. 

15 

1. Plate 12 x 10 6 293.T cells in 20 ml on a 15 cm 2 plate 24 hours before 
transfection. In general, two 15 cm plates per virus are used. It is highly 
preferred that the cells be well-maintained and of relatively low passage 
number. 

20 2. Mix the following DNAs (preferably made using Endo-free Qiagen Kits 
according to the manufacturer's instructions) in a FACS tube. The DNAs 
should be in Endo-free TE at a concentration of 0.5jag/jjl. 
For 3 plasmid system: 

20 jig vector (transfer plasmid, e.g., pLL3.7 CD8) 
25 1 0 jxg pVS VG (envelope plasmid) 

15 fig A 8.9 (packaging plasmid) 
For 4 plasmid, system (recommended), 

20 |ig vector, (transfer plasmid, e.g., pLL3.7 CD8) 
10 \i% pVSVG (envelope plasmid) 
30 10 ^ig RS V-REV (plasmid supplies Rev protein) 

10 |iig pMDL g/p RRE (packaging plasmid) 
The envelope plasmid, packaging plasmids, and Rev-supplying plasmid are 
described in further detail in references 21, 24, 40, and 41. 
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Add 400 pi 1.25 M CaCl 2 and 1.5 ml H 2 0 and mix by tapping gently. 
The following steps are done 1 plate at a time. 

3. Add 2 ml of 2X HBS dropwise to DNA mixture while bubbling with a Pasteur 
5 pipette. When finished, continue to bubble for 12-15 seconds. 

4. Take plate of 293T out of the incubator (plate remains in incubator for long as 
possible), and add transfection mixture dropwise all over the plate. Gently 
swirl plate from front to back, and return immediately to incubator. 

5. 3.5 to 4 hours later, remove media, wash 2x with 10ml warm PBS, and add 20 
10 ml warm D10 onto plate and place in incubator. 

6. 36-48 hours after transfection, harvest viral supernatant and spin at 2000 rpm, 
7 min at 4°C in a 50ml tube. 

7. Filter viral SN through .45 um filter. Add 35ml of filtered supernatant to an 
ultracentrifuge tube. Balance tubes with additional media. Cover tubes with 

15 small piece of parafilm. (It is useful to titer some of the leftover supernatant to 

determine if there is loss of virus during concentration.) 

8. Spin tubes using a SW-28 rotor at 25,000 rpm, 90 min, 4°C. Decant liquid 
and leave tube upside down on kimwipe for 10 min. Aspirate remaining 
media being careful not to touch bottom of tube. 

20 9. Add 15pl cold PBS (for embryo infections, or any volume you wish) and 
leave tube at 4°C O/N with no shaking. 

10. To resuspend, hold tube at angle and pipet fluid over pellet 20 times, being 
careful not to touch pellet with tip. It is expected that the pellet not be 
resuspended after this is complete. This pellet does not contain virus and can 

25 be discarded. 

1 1 . Aliquot or use virus. Virus should be aliquoted, flash-frozen in liquid nitrogen 
and stored at -80. There should be no change in titer with freezing 
concentrated virus. Avoid multiple freeze-thaws. 

[00296] Results 

30 [00297] To test whether pLL3.7 vectors could be used to generate infectious 
lentivirus particles, 293T cells were transfected with pLL3.7 or pLL3.7 CD8, and 
three lentiviral packaging plasmids developed by Miyoshi et al. (29). Our initial 
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concern was that a lentiviral construct containing shRNA sequences would itself be 
susceptible to RNA interference, thus preventing generation of viral (RNA) genomes 
and production of infectious viral particles. We failed to find any evidence of this 
type of auto-inhibition. We were able to generate viral stocks of pLL3.7 and pLL3.7 
5 CDS that could infect mouse fibroblast 3T3 cells, as gauged by GFP expression, and 
the titre of these viral stocks was always qualitatively similar. These results 
demonstrated that lentiviruses capable of mediating gene silencing can be generated 
efficiently. In other words, shRNAs generated by lentiviral vectors can target 
endogenous cellular transcripts but do not inhibit production of viral RNAs carrying 
10 the same sequences. 

[00298] Example 5: Stable Silencing of Genes and Production of Processed 
shRNAs in T Cells by a Lentiviral Vector 
[00299] Materials and Methods 

15 [00300] Cell culture, lentivirus production, and lentivirus infection. Cell culture 
was performed as in Example 2. Lentiviral production and infection were performed 
as described (24) in this and following examples unless otherwise indicated. For 
some experiments, sorted populations of infected E10 cells were maintained in long- 
term culture. E10 cells pLL3.7 CD8 (CD8 RNAi virus) were sorted four days after 

20 infection for GFP expression and low CD8 expression, while cells infected with 

control virus were sorted for GFP expression only. Each population was cultured for 1 
month and analyzed for CD8 expression via flow cytometry at weekly intervals. 
[00301] Results 

[00302] We examined whether the LentiLox system could be used to silence gene 
25 expression upon infection of mammalian cells. To accomplish this, E 1 0 cells were 
infected with pLL3.7 and pLL3.7 CD8 viruses. A low viral titre was used so that only 
a fraction of cells became infected, as gauged by GFP expression (Fig. 18 A). This 
allowed us to follow the fate of both infected and non-infected cells simultaneously. 
Infected (GFP 4 ) cells on average showed a 16-fold reduction of CD8 expression. 
30 Figure 2a shows density plots indicating the expression levels of CD4 and CD8 48 
hours post-infection. 
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[00303] Expression of CD8 and of other surface proteins was measured at 48 hours 
and at various times thereafter. Inhibition of CDS expression in E10 cells infected 
with pLL3.7 CD8 was specific since levels of other surface proteins were not altered 
(Fig. 18B). Furthermore, in a subline of E10 cells engineered to express human CD8, 
5 which differs from mouse CD8 by 4 out of 19 nucleotides in the region that we 
targeted, we showed that pLL3.7 CD8 selectively reduced expression of the mouse 
protein. Figure 24 shows expression of human CD8 in 3 populations of cells that 
either were (lower panels) or were not (upper panels) transfected with a construct 
encoding human CD8 (hCD8). The leftmost panels show expression of human CDS 

10 in wild type ES cells, illustrating expression of hCD8 in transfected cells (lower left 
panel; cells below bar display hCD8 expression. The middle panels show expression 
of human CD8 in a population of ES cells that were infected with pLL3.7 CD8 and 
displayed effective silencing of mouse CD8 (low CD8). As shown in the lower 
middle panel, this population of cells did not display silencing of human CD8. The 

15 rightmost panels show expression of human CD8 in a population of ES cells that were 
infected with pLL3.7 CD8 and did not display extensive silencing of mouse CDS 
(high CDS). As shown in the lower right panel, this population of cells also did not 
display silencing of human CD8. These data show that an shRNA targeted to mouse 
CD8 does not silence human CD8, confirming the specificity of silencing. Cells 

20 infected with a control virus (pLL3.7) or a virus that expressed a neuron-specific 
shRNA (pLL3.7 Mena+) showed no decrease in CD8 levels (Fig. 18A and data not 
shown). 

[00304] To confirm that the decrease in surface CD8 expression seen in E10 cells 
resulted from mRNA degradation, we assayed CD8 mRNA levels in sorted (GFP + ) 

25 cell populations infected either with a control virus (pLL3.7) or CD8 RNAi virus 
(pLL3.7 CD8) (Fig. 19A). Consistent with results showing significant reduction of 
CD8 protein levels, the amount of CD8 transcripts in E10 cells infected with pLL3.7 
CD8 was 13-fold lower then in controls (Fig. 19B). The same cells expressed normal 
amounts of CD4 transcripts (Fig. 19B), confirming the specificity of the RNAi 

30 knockdown. We also examined whether the shRNAs encoded by pLL3.7 CD8 were 
processed into the approximately 21 nucleotide-long RNAs reported to mediate RNAi 
(12) by blotting cellular RNA extracts with a probe directed against the anti-sense 
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strand of the CD8 stem loop. Only the pLL3.7 CD8 infected cells produced CD8 
siRNAs (Fig. 19C). The dominant population of siRNAs detected was 21 bases in 
length; although 1-2 bp longer species of siRNAs were also present (Fig. 19C). No 
precursor shRNA was visible on the autoradiogram. 
5 [00305] To test the stability of lentivirus-mediated RNAi in mammalian cells, we 
followed expression of CD8 in long-teim cultures of E10 cells infected with pLL3.7 
or pLL3.7 CD8. Cells were sorted based on GFP and CD8 expression levels four days 
after infection with lentivirus, and subsequently monitored for expression of these 
proteins weekly. No change in expression of CD8 was observed and these cells 

1 0 remained uniformly GFP positive, demonstrating that RNAi mediated by the 

integrated lentivirus was stable (Fig. 19 A). In each experiment a small fraction (2 to 
15%) of E10 cells infected with pLL3.7 CD8 showed no evidence of gene silencing, 
maintaining wild type CD8 expression (Fig. 19A). This was not necessarily the result 
of a low copy number of integrated viruses or poor expression of viral genes since 

15 some of these cells expressed very high levels of GFP (Fig. 19 A). As shown in the 
Northern blot in Figure 25, we were able to determine that these cells expressed little, 
if any shRNAs directed against CD8, suggesting that the activity of the polIII 
promoter was reduced. 

[00306] Example 6: Functional Gene Silencing in Differentiated Mammalian Cells 
20 Induced by Lentiviruses 

[00307] Materials and Methods 

[00308] T-cell purification and stimulation. Cells were harvested from spleen and 
lymph nodes. They were plated in RPMI with 10% FBS supplemented with 1 ug/ml 
ova peptide. Cells were infected 24 and 48 hours after plating and analyzed 72 hours 

25 after plating. This activation method yields >90% purity of T-cells. 

[00309] Viral infection. Spin infection was performed as described for retrovirus 
in van Parijs, L., et al y Immunity, 1 1 :281, 1999 using 50 ul of concentrated lentivirus. 
[00310] We tested whether the LentiLox-based RNAi system could be used to 
silence gene expression in primary mammalian cells. In these experiments, we 

30 purified CD8 + T cells from the spleens of OTI T-cell receptor (TCR) transgenic mice, 
activated them with cognate peptide antigen, and then infected these cells with 
pLL3.7 or pLL3.7 CD8. After three days in culture, the T cells were harvested and 
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analysed for GFP and CD8 expression by flow cytometry. Between 68 and 82% of 
the cells were infected, as gauged by GFP expression (Fig. 4a). The infected (GFP + ) 
population reproducibly showed approximately a 14-fold reduction in CDS 
expression, demonstrating that lentivirus-driven expression of shRNAs efficiently 
5 silenced gene expression in primary T cells (Fig. 20A). This effect of pLL3.7 CD8 
was specific since infected cells showed normal expression of other T cell surface 
markers (Fig. 20A). 

[00311] We next examined whether the degree of gene silencing achieved in 
primary mammalian cells using the LentiLox system was functionally relevant. To 

0 accomplish this we performed an experiment aimed at studying the biological effects 
of targeting the IL-2 receptor (IL-2R) in T cells using lentivirus-mediated RNAi. IL-2 
is an important growth factor for T cells, and T cells derived from mice that lack the 
receptor for this cytokine fail to proliferate in vitro (31). To determine whether we 
could phenocopy IL-2R-deficiency in primary T cells using lentivirus-mediated 

5 RNAi, we designed a shRNA against the alpha chain of the IL-2 receptor (CD25) and 
used this sequence to create pLL3.7 CD25. The shRNA sequences were as follows: 
[00312] CD25 sense: 5'- tgcattcacctaatcggctgttcaagagacagccgattaggtgaatgcttttttg- 
3'(SEQIDNO: 38) 
[00313] CD25 antisense: 

0 5 '-gtcaccaaaaaagcattcacctaatcggctgtctcttgaacagccgattaggtgaatgca-S * (SEQ ID NO: 

39) 

[00314] In most experiments between 70 and 85% of activated CD8+ TCR 
transgenic T cells were infected with this virus (Fig. 20A). Infected cells on average 
showed a 25-fold reduction in IL-2Ra chain expression, but expressed normal levels 
5 of other surface markers (Fig. 20A). These cells were challenged with increasing 
concentrations of IL-2, resulting in a 4- to 5-fold reduction in the response to this 
cytokine (Figure 20B). Therefore, the LentiLox RNAi system can be used to 
phenocopy loss-of-function in primary T cells. 

0 [00315] Example 7: Functional Silencing of Genes in Embryonic Stern Cell- 
derived Mice by a Lentiviral Vector 
[00316] Materials and Methods 
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[00317] Generation of lentiviral transfer plasmids containing shRNAs targeted to 
Mena+, Beta-catenin, andp53. Oligonucleotides having the following sequences 
were inserted into pLL3.7 as described above to produce lentiviral transfer plasmids 
capable of directing expression of shRNAs targeted to Mena+, Beta-catenin, or p53 
5 transcripts. 

[00318] Mena+ sense: 5 9 - 

tgtcctgtgcctggcctactttcaagagaagtaggccaggcacaggactttttggaaac-3 , (SEQ ID NO: 26) 
[00319] Mena+ antisense: 

[00320] 5 '-tcgagttcccaaaaagtcctgtgcctggcctacttctcttgaaagtaggccaggcacaggaca-3 ' 
10 (SEQ ID NO: 27) 

[00321] Beta-catenin sense: 

[00322] 5'- tgtccagcgcttggctgaacttcaagagtgttcagccaagcgctggactttttggaaa-3' (SEQ 
ID NO: 28) 

[00323] Beta antisense: 
1 5 [00324] 5 tcgatttccaaaaagtccagcgcttggctgaacactcttgaagttcagccaagcgctggaca-3 ' 
(SEQ ID NO: 29) 
[00325] P53 sense: 
[00326] 5^ 

tggtctaagtggagcccttcgagtgttagaagcttgtgacactcggagggcttcacttgggcctttttggaaa-3 , (SEQ 
20 ID NO: 30) 

[00327] P53 antisense: 
[00328] 5' - 

tcgatttccaaaaaggcccaagtgaagccctccgagtgtcacaagcttctaacactcgaagggctccacttagacca - 3 * 
(SEQ ID NO: 31) 

25 [00329] ES Cells: AK7 ES cells were maintained and infected as described (23). 
Clones of ES cells were picked, expanded, and analyzed by flow cytometry for GFP 
expression. If the clone contained a mixed population of infected and uninfected 
cells, the GFP population was purified by fluorescence activated cell sorting. 
[00330] Production of transgenic mice: Transgenic mice were generated 

30 essentially as described in reference 24. 
[00331] Results 
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[00332] A unique feature of lentivirus-based vectors is that they can stably express 
transgenes in stem cells and are not silenced during development, allowing for the 
efficient generation of transgenic mice (23, 24). We tested whether the LentiLox 
RNAi system could be used to silence gene expression in stem cells, as well as 
5 animals generated from these cells. To accomplish this we infected embryonic stem 
cells with versions of the pLL3.7 vector that expressed shRNAs against CD8 (pLL3.7 
CD8), Mena+ (pLL3.7 Mena+), or p53 (pLL3.7 p53). 

[00333] We found that these vectors could efficiently infect embryonic stem cells, 
and we are able to generate and maintain stable lines of infected ES cells (Figure 21 A, 

1 0 and data not shown). 

[00334] To test whether ES cells infected with RNAi lentivirus were capable of 
giving rise to progeny that showed gene silencing, we generated uniformly GFP+ ES 
cell populations infected with pLL3.7 CD8, pLL3.7 Mena+, or the empty vector, 
pLL3.7, by cell sorting. Ten to twelve of these cells were injected into day 3 

15 blastocysts, which were subsequently implanted into pseudopregnant recipients. To 
ensure that the lentivirus-infected ES cells contributed to immune tissues in the 
chimeric offspring, we used RAG2" A blastocysts in these experiments. This genetic 
lesion blocks the development of B and T cells, so that any immune cells present in 
the chimeric progeny must be derived from the injected (wild type) ES cells (32). 

20 Using this approach we generated mice derived from ES cells infected with pLL3.7 
CD8, pLL3.7 Mena+, and pLL3.7. The degree of chimerism in these animals was 
between 50 and 90% as gauged by GFP fluorescence analysis of whole mice, as well 
as dissected organs (Figure 2 IB and data not shown). This result demonstrated that 
cells expressing siRNAs were not selected against during development and that these 

25 cells were able to contribute to all tissues in the body. 

[00335] In our chimeric mice, almost all cells in the lymphoid organs expressed 
GFP, indicating that they were derived from the injected ES cells (Figure 21C and 
data not shown). To examine whether lentivirus-mediated expression of shRNAs 
resulted in the silencing of CD8 in vivo, we harvested the thymus and spleen of 7 day- 

30 old chimeric mice and stained the cells present in these organs with antibodies against 
CD8 and CD4 according to standard techniques. We found that developing T cells in 
the thymus of pLL3.7 CD8 mice showed a 7-fold reduction in CD8 expression 
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(Figure 2 ID). Furthermore, no mature CD8+ T cells were detected in this organ or in 
the spleen (Figure 2 ID). In contrast, thymocytes from these mice showed normal 
expression of CD4 and normal numbers of mature CD4+ T cells were found in their 
lymphoid organs. (Figure 2 ID). T cell differentiation and numbers were normal in 
5 mice derived from pLL3.7 Mena+ and pLL3.7 infected ES cells (Figure 21D and data 
not shown). 

[00336] Example 8: Cre-mediated Extinguishing of a Transgene Expressed from a 
Lentiviral Vector 

10 [00337] This example demonstrates that introduction of Cre recombinase into cells 
expressing a transgene from a lentiviral vector of the invention extinguishes 
expression of the transgene. 
[00338] Materials and Methods 

[00339] Expression ofEGFP using a lentiviral vector, A 50% confluent 10 cm 
15 plate of D7 cells (See Bear JE, et aL, Cell 2000 Jun 23;101(7):717-2 for description of 
cells and culture conditions.), was infected with lOOul of concentrated pLL3.7 B- 
catenin lentivirus, which expressed GFP as a transgene between two LoxP sites. 
Infected cells (pLL3.7 B-catenin containing cells) were sorted based upon expression 
ofEGFP. 

20 [00340] Introduction of Cre. A 50% confluent 6 cm plate of sorted D7 pLL3 .7 B- 
catenin containing cells was infected with adenovirus expressing the Cre recombinase 
(Jackson EL, et aL, Genes Dev 2001 Dec 15;15(24):3243-8. 1x10 s infectious units 
were used in the infection. Cells were expanded for 10 days to allow time for 
expression of Cre protein, deletion of the loxP-CMVegfp-loxP cassette, and depletion 

25 of EGFP protein pools. Cells were then analyzed by flow cytometry for expression of 
EGFP. Cells were also sorted based upon loss ofEGFP expression and expanded. 
[00341] Results 

[00342] Figure 22A shows flow cytometric analysis ofEGFP expression in cells 
infected with an EGFP-expressing lentiviral vector in which the promoter and EGFP 
30 coding sequences are floxed. Flow cytometry was performed at least 48 hours after 
infection. The solid purple peaks in Figure 22A and 22B represent uninfected cells. 
As shown in Figure 22A, ninety percent of the infected cells express EGFP. The 
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population of cells expressing EGFP is shown with a green line. Figure 22B shows 
flow cytometric analysis of EGFP expression in cells infected with an EGFP- 
expressing lentiviral vector 10 days after induction of Cre expression. The percentage 
of EGFP-expressing cells is reduced to 49%. Figure 22C shows a direct comparison 
5 between pLL3.7 infected D7 cells before (green line) and after (pink line) Cre 
delivery. Induction of Cre extinguished expression of the floxed transgene in 
approximately half the cells. (The adenoviral titer was not high enough to infect all 
cells, thus cells in which the transgene was not extinguished were probably not 
infected with adenovirus.) 
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[00343] Sequences of pBFGW and pLL3. 0 - pLL3. 7. This section presents the 
sequences of plasmids pBFGW and pLL3.0 - pLL3.7 in the form of GenBank files. 
[00344] pBFGW 

LOCUS PBFGW. GB 10441 BP DS-DNA CIRCULAR SYN 23- JAN - 

2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 

gene 

rep_origin 
polyA__signal 
gene 

polyA_signal 
rep_origin 
promoter 
LTR 

misc_f eature 
misc_f eature 
promoter 
promoter 
intron 
gene 

misc_f eature 
misc feature 



2414 



Location/Qualifiers 
212. .816 
/note* n CMV 1" 
9448. .10308 
/note="AmpR" 
8630. .9303 
/note=»pUC" 
6452. .6666 
/note="BGH pA" 
7613.. 7987 
/note="ZeoR" 
8117. .8246 
/note=:"SV4 0 pA" 
6729. .7142 
/note="fl origin" 
7205. .7530 
/note="SV4 0 ori" 
835. .1509 
/note= n 5« LRT" 
1533. .2390 

/note= "ps i sequence " 
2416. .2593 
/note="FLAP" 
2612. .2974 
/note="CMV 2" 
2798. .4229 

/note="Beta actin promoter" 
4234 . .4327 

/note="beta globin intron" 
4373. .5089 
/note="EGFP» 
5132. .5721 
/note="WRE» 
5737. .6426 
/note="3' LTR" 
A 2697 C 2911 G 2419 T 



0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 
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3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

5 421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 

GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
0 TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC 

5 721 ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 

ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
0 CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
TAGCAGTGGC 

5 1021 GCCCGAACAG GGACTTGAAA GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 

CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
0 ATTAAGCGGG 

1201 GGAGAATTAG ATCGCGATGG GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
AAAAAATATA 

1261 AATTAAAACA TATAGTATGG GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
AATCCTGGCC 

5 1321 TGTTAGAAAC ATCAGAAGGC TGTAGACAAA TACTGGGACA GCTACAACCA 

TCCCTTCAGA 

1381 CAGGATCAGA AGAACTTAGA TCATTATATA ATACAGTAGC AACCCTCTAT 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
■0 GAGCAAAACA 

1501 AAAGTAAGAC CACCGCACAG CAAGCGGCCG CTGATCTTCA GACCTGGAGG 
AGGAGATATG 

1561 AGGGACAATT GGAGAAGTGA ATTATATAAA TATAAAGTAG TAAAAATTGA 
ACCATTAGGA 

£ 1621 GTAGCACCCA CCAAGGCAAA GAGAAGAGTG GTGCAGAGAG AAAAAAGAGC 

AGTGGGAATA 

1681 GGAGCTTTGT TCCTTGGGTT CTTGGGAGCA GCAGGAAGCA CTATGGGCGC 
AGCGTCAATG 

1741 ACGCTGACGG TACAGGCCAG ACAATTATTG TCTGGTATAG TGCAGCAGCA 
iO GAACAATTTG 

1801 CTGAGGGCTA TTGAGGCGCA ACAGCATCTG TTGCAACTCA CAGTCTGGGG 
CATCAAGCAG 

1861 CTCCAGGCAA GAATCCTGGC TGTGGAAAGA TACCTAAAGG ATCAACAGCT 
CCTGGGGATT 

15 1921 TGGGGTTGCT CTGGAAAACT CATTTGCACC ACTGCTGTGC CTTGGAATGC 

TAGTTGGAGT 

1981 AATAAATCTC TGGAACAGAT TTGGAATCAC ACGACCTGGA TGGAGTGGGA 
CAGAGAAATT 
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2 041 AACAATTACA CAAGCTTAAT 
GCAAGAAAAG 

2101 AATGAACAAG AATTATTGGA 
GTTTAACATA 
5 2161 ACAAATTGGC TGTGGTATAT 

GGTAGGTTTA 

2221 AGAATAGTTT TTGCTGTACT 
TTCACCATTA 

2281 TCGTTTCAGA CCCACCTCCC 
10 AATAGAAGAA 

2341 GAAGGTGGAG AGAGAGACAG 
GGCACTGCGT 

2401 GCGCCAATTC TGCAGACAAA 
AAGGGGGGAT 
15 2461 TGGGGGGTAC AGTGCAGGGG 

TAC AAACTAA 

2521 AGAATTACAA AAACAAATTA 
GGGACAGCAG 

2581 AGATCCAGTT TGGTTAATTA 
20 AATTACGGGG 

2641 TCATTAGTTC ATAGCCCATA 
AAATGGCCCG 

2701 CCTGGCTGAC CGCCCAACGA 
TGTTCCCATA 
25 2761 GTAACGCCAA TAGGGACTTT 

GTAAACTGCC 

2 821 CACTTGGCAG TACATCAAGT 
CGTCAATGAC 

2881 GGTAAATGGC CCGCCTGGCA 
30 TCCTACTTGG 

2941 CAGTACATCT ACGTATTAGT 
CACGTTCTGC 

3001 TTCACTCTCC CCATCTCCCC 
TATTTTTTAA 
35 3 061 TTATTTTGTG CAGCGATGGG 

CGGGGCGGGG 

3121 CGGGGCGAGG GGCGGGGCGG 
TCAGAGCGGC 

3181 GCGCTCCGAA AGTTTCCTTT 
40 TAAAAAGCGA 

3241 AGCGCGCGGC GGGCGGGGAG 
CGCTCCGCCG 

3301 CCGCCTCGCG CCGCCCGCCC 
GTGAGCGGGC 
45 '33 61 GGGACGGCCC TTCTCCTCCG 

CTTGTTTCTT 

3421 TTCTGTGGCT GCGTGAAAGC 
GGGGGGAGCG 

3481 GCTCGGGGGG TGCGTGCGTG 
50 TCCGCGCTGC 

3541 CCGGCGGCTG TGAGCGCTGC 
AGTGTGCGCG 

3601 AGGGGAGCGC GGCCGGGGGC 
GGGAACAAAG 
55 3661 GCTGCGTGCG GGGTGTGTGC 

TCGGTCGGGC 

3721 TGCAACCCCC CCTGCACCCC 
TCGGGTGCGG 



ACACTCCTTA ATTGAAGAAT CGCAAAACCA 
ATTAGATAAA TGGGCAAGTT TGTGGAATTG 
AAAATTATTC ATAATGATAG TAGGAGGCTT 
TTCTATAGTG AATAGAGTTA GGCAGGGATA 
AACCCCGAGG GGACCCGACA GGCCCGAAGG 
AGACAGATCC ATTCGATTAG TGAACGGATC 
TGGCAGTATT CATCCACAAT TTTAAAAGAA 
AAAGAATAGT AGACATAATA GCAACAGACA 
CAAAAATTCA AAATTTTCGG GTTTATTACA 
ACTGCAGGAA TCTAGTTATT AATAGTAATC 
TATGGAGTTC CGCGTTACAT AACTTACGGT 
CCCCCGCCCA TTGACGTCAA TAATGACGTA 
CCATTGACGT CAATGGGTGG AGTATTTACG 
GTATCATATG CCAAGTACGC CCCCTATTGA 
TTATGCCCAG TACATGACCT TATGGGACTT 
CATCGCTATT ACCATGGTCG AGGTGAGCCC 
CCCCTCCCCA CCCCCAATTT TGTATTTATT 
GGCGGGGGGG GGGGGGGGGC GCGCGCCAGG 
GGCGAGGCGG AGAGGTGCGG CGGCAGCCAA 
TATGGCGAGG CGGCGGCGGC GGCGGCCCTA 
TCGCTGCGAC GCTGCCTTCG CCCCGTGCCC 
CGGCTCTGAC TGACCGCGTT ACTCCCACAG 
GGCTGTAATT AGCGCTTGGT TTAATGACGG 
CTTGAGGGGC TCCGGGAGGG CCCTTTGTGC 
TGTGTGTGCG TGGGGAGCGC CGCGTGCGGC 
GGGCGCGGCG CGGGGCTTTG TGCGCTCCGC 
GGTGCCCCGC GGTGCGGGGG GGGCTGCGAG 
GTGGGGGGGT GAGCAGGGGG TGTGGGCGCG 
CCTCCCCGAG TTGCTGAGCA CGGCCCGGCT 
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3781 GGCTCCGTAC GGGGCGTGGC 
CGGCAGGTGG 

3841 GGGTGCCGGG CGGGGCGGGG 
GGGCGCGGCG 
5 3 901 GCCCCCGGAG CGCCGGCGGC 

CTTTTATGGT 

3 961 AATCGTGCGA GAGGGCGCAG 
CCGAAATCTG 

4021 GGAGGCGCCG CCGCACCCCC 
10 GCCGGCAGGA 

4081 AGGAAATGGG CGGGGAGGGC 
CTCCCTCTCC 

4141 AGCCTCGGGG CTGTCCGCGG 
AGGGCGGGGT 
15 4201 TCGGCTTCTG GCGTGTGACC 

CATGCCTTCT 

4261 TCTTTTTCCT ACAGCTCCTG 
CATTTTGGCA 

4321 AAGAATTGAT TTGATACCGC 
20 CCATGGTGAG 

4381 CAAGGGCGAG GAGCTGTTCA 
ACGGCGACGT 

4441 AAACGGCCAC AAGTTCAGCG 
ACGGCAAGCT 
25 4501 GACCCTGAAG TTCATCTGCA 

CCCTCGTGAC 

4561 CACCCTGACC TACGGCGTGC 
AGCAGCACGA 

4621 CTTCTTCAAG TCCGCCATGC 
30 TCTTCAAGGA 

4681 CGACGGCAAC TACAAGACCC 
TGGTGAACCG 

4741 CATCGAGCTG AAGGGCATCG 
ACAAGCTGGA 
35 4801 GTACAACTAC AAC AG CCAC A 

ACGGCATCAA 

4 861 GGTGAACTTC AAGATCCGCC 
CCGACCACTA 

4921 CCAGCAGAAC ACCCCCATCG 
40 ACTACCTGAG 

4981 CACCCAGTCC GCCCTGAGCA 
TCCTGCTGGA 

5041 GTTCGTGACC GCCGCCGGGA 
AAAGCGGCCG 
45 5101 CGACTCTAGA ATTCGATATC 

AAAATTTGTG 

5161 AAAGATTGAC TGGTATTCTT 
TACGCTGCTT 

5221 TAATGCCTTT GTATCATGCT 
50 TCCTTGTATA 

5281 AATCCTGGTT GCTGTCTCTT 
CGTGGCGTGG 

5341 TGTGCACTGT GTTTGCTGAC 
ACCTGTCAGC 
55 5401 TCCTTTCCGG GACTTTCGCT 

ATCGCCGCCT 

5461 GCCTTGCCCG CTGCTGGACA 
GTGGTGTTGT 



GCGGGGCTCG CCGTGCCGGG CGGGGGGTGG 
CCGCCTCGGG CCGGGGAGGG CTCGGGGGAG 
TGTCGAGGCG CGGCGAGCCG CAGCCATTGC 
GGACTTCCTT TGTCCCAAAT CTGTGCGGAG 
TCTAGCGGGC GCGGGGCGAA GCGGTGCGGC 
CTTCGTGCGT CGCCGCGCCG CCGTCCCCTT 
GGGGACGGCT GCCTTCGGGG GGGACGGGGC 
GGCGGCTCTA GAGCCTCTGC TAACCATGTT 
GGCAACGTGC TGGTTATTGT GCTGTCTCAT 
GGGCCCGGGA TCCCCGGGTA CCGGTCGCCA 
CCGGGGTGGT GCCCATCCTG GTCGAGCTGG 
TGTCCGGCGA GGGCGAGGGC GATGCCACCT 
CCACCGGCAA GCTGCCCGTG CCCTGGCCCA 
AGTGCTTCAG CCGCTACCCC GACCACATGA 
CCGAAGGCTA CGTCCAGGAG CGCACCATCT 
GCGCCGAGGT GAAGTTCGAG GGCGACACCC 
ACTTCAAGGA GGACGGCAAC ATCCTGGGGC 
ACGTCTATAT CATGGCCGAC AAGCAGAAGA 
ACAACATCGA GGACGGCAGC GTGCAGCTCG 
GCGACGGCCC CGTGCTGCTG CCCGACAACC 
AAGACCCCAA CGAGAAGCGC GATCACATGG 
TCACTCTCGG CATGGACGAG CTGTACAAGT 
AAGCTTATCG ATAATCAACC TCTGGATTAC 
AACTATGTTG CTCCTTTTAC GCTATGTGGA 
ATTGCTTCCC GTATGGCTTT CATTTTCTCC 
TATGAGGAGT TGTGGCCCGT TGTCAGGCAA 
GCAACCCCCA CTGGTTGGGG CATTGCCACC 
TTCCCCCTCC CTATTGCCAC GGCGGAACTC 
GGGGCTCGGC TGTTGGGCAC TGACAATTCC 
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5521 CGGGGAAATC ATCGTCCTTT 
ATTCTGCGCG 

5581 GGACGTCCTT CTGCTACGTC 
TCCCGCGGCC 
5 5641 TGCTGCCGGC TCTGCGGCCT 

AGTCGGATCT 

5701 CCCTTTGGGC CGCCTCCCCG 
AAACATGGAG 

5761 CAATCACAAG TAGCAATACA 
10 GAAGCACAAG 

5821 AGGAGGAGGA GGTGGGTTTT 
ATGACTTACA 

5881 AGGCAGCTGT AGATCTTAGC 
GGGCTAATTC 
15 5941 ACTCCCAACG AAGACAAGAT 

GGCTACTTCC 

6001 CTGATTGGCA GAACTACACA 
TTTGGATGGT 

6061 GCTACAAGCT AGTACCAGTT 
20 GGAGAGAACA 

6121 CCCGCTTGTT ACACCCTGTG 
GAAGTATTAG 

6181 AGTGGAGGTT TGACAGCCGC 
CATCCGGACT 
25 6241 GTACTGGGTC TCTCTGGTTA 

TAACTAGGGA 

6301 ACCCACTGCT TAAGCCTCAA 
TGTGCCCGTC 

6361 TGTTGTGTGA CTCTGGTAAC 
30 TGGAAAATCT 

6421 CTAGCAGGGC CCGTTTAAAC 
TTGCCAGCCA 

6481 TCTGTTGTTT GCCCCTCCCC 
TCCCACTGTC 
35 6541 CTTTCCTAAT AAAATGAGGA 

TTCTATTCTG 

6601 GGGGGTGGGG TGGGGCAGGA 
CAGGCATGCT 

6661 GGGGATGCGG TGGGCTCTAT 
40 CTCTAGGGGG 

6721 TATCCCCACG CGCCCTGTAG 
TACGCGCAGC 

6781 GTGACCGCTA CACTTGCCAG 
CCCTTCCTTT 
45 6841 CTCGCCACGT TCGCCGGCTT 

TTTAGGGTTC 

6901 CGATTTAGTG CTTTACGGCA 
TGGTTCACGT 

6961 AGTGGGCCAT CGCCCTGATA 
50 CACGTTCTTT 

7021 AATAGTGGAC TCTTGTTCCA 
CTATTCTTTT 

7081 GATTTATAAG GGATTTTGCC 
GATTTAACAA 
55 7141 AAATTTAACG CGAATTAATT 

AAGTCCCCAG 

7201 GCTCCCCAGC AGGCAGAAGT 
ACCAGGTGTG 



CCTTGGCTGC TCGCCTGTGT TGCCACCTGG 
CCTTCGGCCC TCAATCCAGC GGACCTTCCT 
CTTCCGCGTC TTCGCCTTCG CCCTCAGACG 
CATCGATACC GTCGACCTCG AGACCTAGAA 
GCAGCTACCA ATGCTGATTG TGCCTGGCTA 
CCAGTCACAC CTCAGGTACC TTTAAGACCA 
CACTTTTTAA AAGAAAAGGG GGGACTGGAA 
ATCCTTGATC TGTGGATCTA CCACACACAA 
CCAGGGCCAG GGATCAGATA TCCACTGACC 
GAGCAAGAGA AGGTAGAAGA AGCCAATGAA 
AGCCTGCATG GGATGGATGA CCCGGAGAGA 
CTAGCATTTC ATCACATGGC CCGAGAGCTG 
GACCAGATCT GAGCCTGGGA GCTCTCTGGC 
TAAAGCTTGC CTTGAGTGCT TCAAGTAGTG 
TAGAGATCCC TCAGACCCTT TTAGTCAGTG 
CCGCTGATCA GCCTCGACTG TGCCTTCTAG 
CGTGCCTTCC TTGACCCTGG AAGGTGCCAC 
AATTGCATCG CATTGTCTGA GTAGGTGTCA 
CAGCAAGGGG GAGGATTGGG AAGACAATAG 
GGCTTCTGAG GCGGAAAGAA CCAGCTGGGG 
CGGCGCATTA AGCGCGGCGG GTGTGGTGGT 
CGCCCTAGCG CCCGCTCCTT TCGCTTTCTT 
TCCCCGTCAA GCTCTAAATC GGGGGCTCCC 
CCTCGACCCC AAAAAACTTG ATTAGGGTGA 
GACGGTTTTT CGCCCTTTGA CGTTGGAGTC 
AACTGGAACA ACACTCAACC CTATCTCGGT 
GATTTCGGCC TATTGGTTAA AAAATGAGCT 
CTGTGGAATG TGTGTCAGTT AGGGTGTGGA 
ATGCAAAGCA TGCATCTCAA TTAGTCAGCA 
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7261 GAAAGTCCCC AGGCTCCCCA 
AATTAGTCAG 

7321 CAACCATAGT CCCGCCCCTA 
AGTTCCGCCC 
5 73 81 ATTCTCCGCC CCATGGCTGA 

GCCGCCTCTG 

7441 CCTCTGAGCT ATTCCAGAAG 
TTTTGCAAAA 

7501 AGCTCCCGGG AGCTTGTATA 
10 ACAATTAATC 

7561 ATCGGCATAG TATATCGGCA 
CCATGGCCAA 

7621 GTTGACCAGT GCCGTTCCGG 
TCGAGTTCTG 
15 7681 GACCGACCGG CTCGGGTTCT 

GTGTGGTCCG 

7741 GGACGACGTG ACCCTGTTCA 
ACAACACCCT 

7801 GGCCTGGGTG TGGGTGCGCG 
20 AGGTCGTGTC 

7861 CACGAACTTC CGGGACGCCT 
AGCCGTGGGG 

7921 GCGGGAGTTC GCCCTGCGCG 
CCGAGGAGCA 
25 7981 GGACTGACAC GTGCTACGAG 

GGTTGGGCTT 

8041 CGGAATCGTT TTCCGGGACG 
TCATGCTGGA 

8101 GTTCTTCGCC CACCCCAACT 
30 AAAGCAATAG 

8161 CATCACAAAT TTCACAAATA 
GTTTGTCCAA 

8221 ACTCATCAAT GTATCTTATC 
GCTTGGCGTA 
35 82 81 ATCATGGTCA TAGCTGTTTC 

CACACAACAT 

8341 ACGAGCCGGA AGCATAAAGT 
AACTCACATT 

8401 AATTGCGTTG CGCTCACTGC 
40 AGCTGCATTA 

8461 ATGAATCGGC CAACGCGCGG 
CCGCTTCCfC 

8521 GCTCACTGAC TCGCTGCGCT 
CTCACTCAAA 
45 8581 GGCGGTAATA CGGTTATCCA 

TGTGAGCAAA 

8641 AGGCCAGCAA AAGGCCAGGA 
TCCATAGGCT 

8701 CCGCCCCCCT GACGAGCATC 
50 GAAACCCGAC 

8761 AGGACTATAA AGATACCAGG 
CTCCTGTTCC 

8821 GACCCTGCCG CTTACCGGAT 
TGGCGCTTTC 
55 8881 TCATAGCTCA CGCTGTAGGT 

AGCTGGGCTG 

8941 TGTGCACGAA CCCCCCGTTC 
ATCGTCTTGA 



GCAGGCAGAA GTATGCAAAG CATGCATCTC 
ACTCCGCCCA TCCCGCCCCT AACTCCGCCC 
CTAATTTTTT TTATTTATGC AGAGGCCGAG 
TAGTGAGGAG GCTTTTTTGG AGGCCTAGGC 
TCCATTTTCG GATCTGATCA GCACGTGTTG 
TAGTATAATA CGACAAGGTG AGGAACTAAA 
TGCTCACCGC GCGCGACGTC GCCGGAGCGG 
CCCGGGACTT CGTGGAGGAC GACTTCGCCG 
TCAGCGCGGT CCAGGACCAG GTGGTGCCGG 
GCCTGGACGA GCTGTACGCC GAGTGGTCGG 
CCGGGCCGGC CATGACCGAG ATCGGCGAGC 
ACCCGGCCGG CAACTGCGTG CACTTCGTGG 
ATTTCGATTC CACCGCCGCC TTCTATGAAA 
CCGGCTGGAT GATCCTCCAG CGCGGGGATC 
TGTTTATTGC AGCTTATAAT GGTTACAAAT 
AAGCATTTTT TTCACTGCAT TCTAGTTGTG 
ATGTCTGTAT ACCGTCGACC TCTAGCTAGA 
CTGTGTGAAA TTGTTATCCG CTCACAATTC 
GTAAAGCCTG GGGTGCCTAA TGAGTGAGCT 
CCGCTTTCCA GTCGGGAAAC CTGTCGTGCC 
GGAGAGGCGG TTTGCGTATT GGGCGCTCTT 
CGGTCGTTCG GCTGCGGCGA GCGGTATCAG 
CAGAATCAGG GGATAACGCA GGAAAGAACA 
ACCGTAAAAA GGCCGCGTTG CTGGCGTTTT 
ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 
CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT 
ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG 
ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA 
AGCCCGACCG CTGCGCCTTA TCCGGTAACT 
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9001 GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA 
ACAGGATTAG 

9061 CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA 
ACTACGGCTA 

5 9121 CACTAGAAGA ACAGTATTTG GTATCTGCGC TCTGCTGAAG CCAGTTACCT 

TCGGAAAAAG 

9181 AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT AGCGGTGGTT 
TTTTTGTTTG 

9241 CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA 
10 TCTTTTCTAC 

93 01 GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 
TGAGATTATC 

93 61 AAAAAGGATC TTCACCTAGA TCCTTTTAAA TTAAAAATGA AGTTTTAAAT 
CAATCTAAAG 

15 9421 TATATATGAG TAAACTTGGT CTGACAGTTA CCAATGCTTA ATCAGTGAGG 

CACCTATCTC 

94 81 AGCGATCTGT CTATTTCGTT CATCCATAGT TGCCTGACTC CCCGTCGTGT 
AGATAACTAC 

9541 GATACGGGAG GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG 
20 ACCCACGCTC 

9601 ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGCCGAGC 
GCAGAAGTGG 

9661 TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT TGCCGGGAAG 
CTAGAGTAAG 

25 9721 TAGTTCGCCA GTTAATAGTT TGCGCAACGT TGTTGCCATT GCTACAGGCA 

TCGTGGTGTC 

9781 ACGCTCGTCG TTTGGTATGG CTTCATTCAG CTCCGGTTCC CAACGATCAA 
GGCGAGTTAC 

9841 ATGATCCCCC ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA 
30 TCGTTGTCAG 

9901 AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA 
ATTCTCTTAC 

9961 TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG TACTCAACCA 
AGTCATTCTG 

35 10021 AGAATAGTGT ATGCGGCGAC CGAGTTGCTC TTGCCCGGCG TCAATACGGG 

ATAATACCGC 

10081 GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA CGTTCTTCGG 
GGCGAAAACT 

10141 CTCAAGGATC TTACCGCTGT TGAGATCCAG TTCGATGTAA CCCACTCGTG 
40 CACCCAACTG 

10201 ATCTTCAGCA TCTTTTACTT TCACCAGCGT TTCTGGGTGA GCAAAAACAG 
GAAGGCAAAA 

10261 TGCCGCAAAA AAGGGAATAA GGGCGACACG GAAATGTTGA ATACTCATAC 
TCTTCCTTTT 

45 10321 TCAATATTAT TGAAGCATTT ATCAGGGTTA TTGTCTCATG AG CGGATACA 

TATTTGAATG 

103 81 TATTTAGAAA AATAAACAAA TAGGGGTTCC GCGCACATTT CCCCGAAAAG 
TGCCACCTGA 
10441 C 

50 // 

(SEQIDNO: 1) 
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[00345] pLL3.0 

LOCUS PLENTILOX 6027 BP DS-DNA CIRCULAR SYN 23-JAN- 

5 2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 

10 FEATURES Location/Qualifiers 
promoter 212 . . 816 

/note="CMV promoter/ enhancer 1" 
gene 5034.. 5894 

/note="AmpR" 
15 rep_origin 4216.. 4889 

/note="pUC" 
misc_recomb 2710. .2743 

/note="LoxP" 
misc_recomb 2827. .2860 

20 ~~ /note="LoxP» 

LTR 835.. 1509 

/note="5' HIV R-U5-del gag (HIV NL4 -3/454 -1126) " 
misc_feature 1539. .2396 

/note="HIV RRE (HIV NL4-3/7622 -8459) " 
25 misc feature 2422.. 2599 

/note="HIV Flap" 
misc_f eature 2915 . . 3504 

/note="WRE element" 
LTR 3524.. 4213 

30 /note="3' SIN LTR" 

BASE COUNT 1612 A 1408 C 1518 G 1489 T 0 OTHER 

ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

35 61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 

GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGC CAGATAT 
40 ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

45 361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 

ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
50 CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

55 661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 

TTGTTTTGGC 

721 ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 
ACGCAAATGG 
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781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 
5 901 AAGCCTCAAT AAAGCTTGCC 

GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
10 CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
ATTAAGCGGG 
15 12 01 GGAGAATTAG ATCGCGATGG 

AAAAAATATA 

1261 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC 
20 TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 
25 1501 AAAGTAAGAC CACCGCACAG 

TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG 
AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
30 AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 
35 1801 AATTTGCTGA GGGCTATTGA 

CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
40 GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 

2041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 
45 2101 GAAAAGAATG AACAAGAATT 

GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC 
50 GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 

2341 GAAGAAGAAG GTGGAGAGAG 
CGGATCGGCA 
55 2401 CTGCGTGCGC CAATTCTGCA 

AAAGAAAAGG 

2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 



GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
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2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

2581 CAGCAGAGAT CCAGTTTGGT 
CGCTAGCCGT 

2641 TAATTAAGCC TCGAGGTCGA 
CAGCAGGTCG 

2701 AGGGACCTAA TAACTTCGTA 
GTTCCAAGCT 

2761 TAAGCGGCCG CCGATGCATG 
TTCGTCGAGG 

2821 GACCTAATAA CTTCGTATAG 
GGGTTCCGGT 

2881 TCCACTAGGT ACAATTCGAT 
TACAAAATTT 

2941 GTGAAAGATT GACTGGTATT 
GGATACGCTG 

3001 CTTTAATGCC TTTGTATCAT 
TCCTCCTTGT 

3 061 ATAAATCCTG GTTGCTGTCT 
CAACGTGGCG 

.3121 TGGTGTGCAC TGTGTTTGCT 
ACCACCTGTC 

3181 AGCTCCTTTC CGGGACTTTC 
CTCATCGCCG 

3241 CCTGCCTTGC CCGCTGCTGG 
TCCGTGGTGT 

3301 TGTCGGGGAA ATCATCGTCC 
TGGATTCTGC 

3361 GCGGGACGTC CTTCTGCTAC 
CCTTCCCGCG 

3421 GCCTGCTGCC GGCTCTGCGG 
ACGAGTCGGA 

34 81 TCTCCCTTTG GGCCGCCTCC 
C CTAGAAAAA 

3541 CATGGAGCAA TCACAAGTAG 
CTGGCTAGAA 

3601 GCACAAGAGG AGGAGGAGGT 
AAGACCAATG 

3661 ACTTACAAGG CAGCTGTAGA 
ACTGGAAGGG 

3721 CTAATTCACT CCCAACGAAG 
CACACAAGGC 

3781 TACTTCCCTG ATTGGCAGAA 
ACTGACCTTT 

3841 GGATGGTGCT ACAAGCTAGT 
CAATGAAGGA 

3 901 GAGAACACCC GCTTGTTACA 
GGAGAGAGAA 

3961 GTATTAGAGT GGAGGTTTGA 
AGAGCTGCAT 

4021 CCGGACTGTA CTGGGTCTCT 
CTCTGGCTAA 

4081 CTAGGGAACC CACTGCTTAA 
AGTAGTGTGT 

4141 GCCCGTCTGT TGTGTGACTC 
GTCAGTGTGG 

4201 AAAATCTCTA GCAGCATGTG 
TAAAAAGGCC 



AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
CCCCGGGATG GCGCGCCATG GATCCGCGAA 
CATACATTAT ACGAAGTTAT ACATGTTTAA 
ATCAAGCTTA TCGATAATCA ACCTCTGGAT 
CTTAACTATG TTGCTCCTTT TACGCTATGT 
GCTATTGCTT CCCGTATGGC TTTCATTTTC 
CTTTATGAGG AGTTGTGGCC CGTTGTCAGG 
GACGCAACCC CCACTGGTTG GGGCATTGCC 
GCTTTCCCCC TCCCTATTGC CACGGCGGAA 
ACAGGGGCTC GGCTGTTGGG CACTGACAAT 
TTTCCTTGGC TGCTCGCCTG TGTTGCCACC 
GTCCCTTCGG CCCTCAATCC AGCGGACCTT 
CCTCTTCCGC GTCTTCGCCT TCGCCCTCAG 
CCGCATCGAT ACCGTCGACC TCGATCGAGA 
CAATACAGCA GCTACCAATG CTGATTGTGC 
GGGTTTTCCA GTCACACCTC AGGTACCTTT 
TCTTAGCCAC TTTTTAAAAG AAAAGGGGGG 
ACAAGATATC CTTGATCTGT GGATCTACCA 
CTACACACCA GGGCCAGGGA TCAGATATCC 
ACCAGTTGAG CAAGAGAAGG TAGAAGAAGC 
CCCTGTGAGC CTGCATGGGA TGGATGACCC 
CAGCCGCCTA GCATTTCATC ACATGGCCCG 
CTGGTTAGAC CAGATCTGAG CCTGGGAGCT 
GCCTCAATAA AGCTTGCCTT GAGTGCTTCA 
TGGTAACTAG AGATCCCTCA GACCCTTTTA 
AGCAAAAGGC CAGCAAAAGG CCAGGAACCG 
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4261 GCGTTGCTGG CGTTTTTCCA 
AAATCGACGC 

4321 TCAAGTCAGA GGTGGCGAAA 
TCCCCCTGGA 
5 4381 AGCTCCCTCG TGCGCTCTCC 

GTCCGCCTTT 

4441 CTCCCTTCGG GAAGCGTGGC 
CAGTTCGGTG 

4501 TAGGTCGTTC GCTCCAAGCT 
10 CGACCGCTGC 

4561 GCCTTATCCG GTAACTATCG 
ATCGCCACTG 

4621 GCAGCAGCCA CTGGTAACAG 
TACAGAGTTC 
15 4681 TTGAAGTGGT GGCCTAACTA 

CTGCGCTCTG , 

4741 CTGAAGCCAG TTACCTTCGG 
ACAAACCACC 

4801 GCTGGTAGCG GTGGTTTTTT 
20 AAAAGGATCT 

4861 CAAGAAGATC CTTTGATCTT 
AAACTCACGT 

4921 TAAGGGATTT TGGTCATGAG 
TTTAAATTAA 
25 4981 AAATGAAGTT TTAAATCAAT 

CAGTTACCAA 

5041 TGCTTAATCA GTGAGGCACC 
CATAGTTGCC 

5101 TGACTCCCCG TCGTGTAGAT 
30 CCCCAGTGCT 

5161 GCAATGATAC CGCGAGACCC 
AAACCAGCCA 

5221 GCCGGAAGGG CCGAGCGCAG 
CCAGTCTATT 
35 52 81 AATTGTTGCC GGGAAGCTAG 

CAACGTTGTT 

5341 GCCATTGCTA CAGGCATCGT 
ATTCAGCTCC 

5401 GGTTCCCAAC GATCAAGGCG 
40 AGCGGTTAGC 

5461 TCCTTCGGTC CTCCGATCGT 
ACTCATGGTT 

5521 ATGGCAGCAC TGCATAATTC 
TTCTGTGACT 
45 5581 GGTGAGTACT CAACCAAGTC 

TTGCTCTTGC 

5641 CCGGCGTCAA TACGGGATAA 
GCTCATCATT 

5701 GGAAAACGTT CTTCGGGGCG 
50 ATCCAGTTCG 

5761 ATGTAACCCA CTCGTGCACC 
CAGCGTTTCT 

5821 GGGTGAGCAA AAACAGGAAG 
GACACGGAAA 
55 5881 TGTTGAATAC TCATACTCTT 

GGGTTATTGT 

5941 CTCATGAGCG GATACATATT 
GGTTCCGCGC 

6001 ACATTTCCCC GAAAAGTGCC 



TAGGCTCCGC CCCCCTGACG AGCATCACAA 
CCCGACAGGA CTATAAAGAT ACCAGGCGTT 
TGTTCCGACC CTGCCGCTTA CCGGATACCT 
GCTTTCTCAT AGCTCACGCT GTAGGTATCT 
GGGCTGTGTG CACGAACCCC CCGTTCAGCC 
TCTTGAGTCC AACCCGGTAA GACACGACTT 
GATTAGCAGA GCGAGGTATG TAGGCGGTGC 
CGGCTACACT AGAAGAACAG TATTTGGTAT 
AAAAAGAGTT GGTAGCTCTT GATCCGGCAA 
TGTTTGCAAG CAGCAGATTA CGCGCAGAAA 
TTCTACGGGG TCTGACGCTC AGTGGAACGA 
ATTATCAAAA AGGATCTTCA CCTAGATCCT 
CTAAAGTATA TATGAGTAAA CTTGGTCTGA 
TATCTCAGCG ATCTGTCTAT TTCGTTCATC 
AACTACGATA CGGGAGGGCT TACCATCTGG 
ACGCTCACCG GCTCCAGATT TATCAGCAAT 
AAGTGGTCCT GCAACTTTAT CCGCCTCCAT 
AGTAAGTAGT TCGCCAGTTA ATAGTTTGCG 
GGTGTCACGC TCGTCGTTTG GTATGGCTTC 
AGTTACATGA TCCCCCATGT TGTGCAAAAA 
TGTCAGAAGT AAGTTGGCCG CAGTGTTATC 
TCTTACTGTC ATGCCATCCG TAAGATGCTT 
ATTCTGAGAA TAGTGTATGC GGCGACCGAG 
TACCGCGCCA CATAGCAGAA CTTTAAAAGT 
AAAACTCTCA AGGATCTTAC CGCTGTTGAG 
CAACTGATCT TCAGCATCTT TTACTTTCAC 
GCAAAATGCC GCAAAAAAGG GAATAAGGGC 
CCTTTTTCAA TATTATTGAA GCATTTATCA 
TGAATGTATT TAGAAAAATA AACAAATAGG 
ACCTGAC 
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(SEQ ID NO: 2) 
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[00346] pLL3.1 

LOCUS PLENTILOX 
2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



6748 BP DS-DNA CIRCULAR SYN 



23 -JAN - 



misc_recomb 
gene 

rep_prigin 

misc_recomb 

LTR 

misc_f eature 
misc_f eature 
misc_f eature 
LTR 
gene 



Location/ Qualifiers 
212. .816 
/note="CMV promoter/enhancer 1" 
3548. .3581 
/note="LoxP" 
5755. .6615 
/note="AmpR" 
4937. .5610 
/note="pUC" 
2710. .2745 
/note="LoxP" 
835. .1509 

/note="5' HIV R-U5-del gag (HIV NL4-3/454 -1126) 
1539. .2396 

/note="HIV RRE (HIV NL4-3/7622-8459) " 
2422. .2599 



/note="HIV Flap" 
3636. .4225 
/note="WRE element" 
4245. .4934 
/note="3' SIN LTR" 
2772. .3494 
/note="EGFP" 

BASE COUNT 1785 A 1651 C 1721 G 1591 T 0 OTHER 

ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC 
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721 ACCAAAATCA ACGGGACTTT 
ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 
5 841 CTCTGGTTAG ACCAGATCTG 

CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
10 TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 
15 1141 TTTTGACTAG CGGAGGCTAG 

ATTAAGCGGG 

1201 GGAGAATTAG ATCGCGATGG 
AAAAAATATA 

1261 AATTAAAACA TATAGTATGG 
20 AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 

1381 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 
25 1441 AAAGGATAGA GATAAAAGAC 

GAGCAAAACA 

1501 AAAGTAAGAC CACCGCACAG 
TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG 
30 AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 
35 1741 TCAATGACGC TGACGGTACA 

GCAGCAGAAC 

1801 AATTTGCTGA GGGCTATTGA 
CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT 
40 ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 
45 2041 GAAATTAACA ATTACACAAG 

AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
50 AGGCTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

2281 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 
55 2341 GAAGAAGAAG GTGGAGAGAG 

CGGATCGGCA 

2401 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 



CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
GACAAATGGC AGTATTCATC CACAATTTTA 



Page 114 of 171 



WO 2004/022722 



PCT/US2003/028111 



2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 
5 2581 CAGCAGAGAT CCAGTTTGGT 

CGCTAGCCGT 

2641 TAATTAAGCC TCGAGGTCGA 
CAGCAGGTCG 

2701 AGGGACCTAA TAACTTCGTA 
10 GTTCCAAGCT 

2761 TAAGCGGCCG CGCCACCATG 
GTGGTGCCCA 

2 821 TCCTGGTCGA GCTGGACGGC 
GGCGAGGGCG 

15 2 881 AGGGCGATGC CACCTACGGC 

GGCAAGCTGC 

2941 CCGTGCCCTG GCCCACCCTC 
TTCAGCCGCT 

3 001 ACCCCGACCA CATGAAGCAG 
20 GGCTACGTCC 

3 061 AGGAG CGCAC CATCTTCTTC 
GAGGTGAAGT 

3121 TCGAGGGCGA CACCCTGGTG 
AAGGAGGACG 
25 3181 GCAACATCCT GGGGCACAAG 

TATATCATGG 

3241 CCGACAAGCA GAAGAACGGC 
ATCGAGGACG 

3301 GCAGCGTGCA GCTCGCCGAC 
30 GGCCCCGTGC 

3361 TGCTGCCCGA CAACCACTAC 
CCCAACGAGA 

3421 AGCGCGATCA CATGGTCCTG 
CTCGGCATGG 
35 3481 ACGAGCTGTA CAAGATGCAT 

ATTCGTCGAG 

3541 GGACCTAATA ACTTCGTATA 
AGGGTTCCGG 

3601 TTCCACTAGG TACAATTCGA 
40 TTACAAAATT 

3661 TGTGAAAGAT TGACTGGTAT 
TGGATACGCT 

3 721 GCTTTAATGC CTTTGTATCA 
CTCCTCCTTG 
45 3781 TATAAATCCT GGTTGCTGTC 

GCAACGTGGC 

3 841 GTGGTGTGCA CTGTGTTTGC 
CACCACCTGT 

3901 CAGCTCCTTT CCGGGACTTT 
50 ACTCATCGCC 

3961 GCCTGCCTTG CCCGCTGCTG 
TTCCGTGGTG 

4021 TTGTCGGGGA AATCATCGTC 
CTGGATTCTG 
55 4081 CGCGGGACGT CCTTCTGCTA 

TCCTTCCCGC 

4141 GGCCTGCTGC CGGCTCTGCG 
GACGAGTCGG 



CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
GTGAGCAAGG GCGAGGAGCT GTTCACCGGG 
GACGTAAACG GCCACAAGTT CAGCGTGTCC 
AAGCTGACCC TGAAGTTCAT CTGCACCACC 
GTGACCACCC TGACCTACGG CGTGCAGTGC 
CACGACTTCT TCAAGTCCGC CATGCCCGAA 
AAGGAGGACG GCAACTACAA GACCCGCGCC 
AACCGCATCG AGCTGAAGGG CATCGACTTC 
CTGGAGTACA ACTACAACAG CCACAACGTC 
ATCAAGGTGA ACTTCAAGAT CCGCCACAAC 
CACTACCAGC AGAACACCCC CATCGGCGAC 
CTGAGCACCC AGTCCGCCCT GAGCAAAGAC 
CTGGAGTTCG TGACCGCCGC CGGGATCACT 
GCCCCGGGAT GGCGCGCCAT GGATCCGCGA 
GCATACATTA TACGAAGTTA TACATGTTTA 
TATCAAGCTT ATCGATAATC AACCTCTGGA 
TCTTAACTAT GTTGCTCCTT TTACGCTATG 
TGCTATTGCT TCCCGTATGG CTTTCATTTT 
TCTTTATGAG GAGTTGTGGC CCGTTGTCAG 
TGACGCAACC CCCACTGGTT GGGGCATTGC 
CGCTTTCCCC CTCCCTATTG CCACGGCGGA 
GACAGGGGCT CGGCTGTTGG GCACTGACAA 
CTTTCCTTGG CTGCTCGCCT GTGTTGCCAC 
CGTCCCTTCG GCCCTCAATC CAGCGGACCT 
GCCTCTTCCG CGTCTTCGCC TTCGCCCTCA 
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4201 ATCTCCCTTT GGGCCGCCTC 
ACCTAGAAAA 

4261 ACATGGAGCA ATCACAAGTA 
CCTGGCTAGA 

4321 AGCACAAGAG GAGGAGGAGG 
TAAGACCAAT 

4381 GACTTACAAG GCAGCTGTAG 
GACTGGAAGG 

4441 GCTAATTCAC TCCCAACGAA 
ACACACAAGG 

4501 CTACTTCCCT GATTGGCAGA 
CACTGACCTT 

4561 TGGATGGTGC TACAAGCTAG 
CCAATGAAGG 

4621 AG AG AAC AC C CGCTTGTTAC 
CGGAGAGAGA 

4681 AGTATTAGAG TGGAGGTTTG 
GAGAGCTGCA 

4741 TCCGGACTGT ACTGGGTCTC 
TCTCTGGCTA 

4801 ACTAGGGAAC CCACTGCTTA 
AAGTAGTGTG 

4 861 TGCCCGTCTG TTGTGTGACT 
AGTCAGTGTG 

4921 GAAAATCTCT AGCAGCATGT 
GTAAAAAGGC 

4981 CGCGTTGCTG GCGTTTTTCC 
AAAATCGACG 

5041 CTCAAGTCAG AGGTGGCGAA 
TTCCCCCTGG 

5101 AAGCTCCCTC GTGCGCTCTC 
TGTCCGCCTT 

5161 TCTCCCTTCG GGAAGCGTGG 
TCAGTTCGGT 

5221 GTAGGTCGTT CGCTCCAAGC 
CCGACCGCTG 

5281 CGCCTTATCC GGTAACTATC 
TATCGCCACT 

5341 GGCAGCAGCC ACTGGTAACA 
CTACAGAGTT 

5401 CTTGAAGTGG TGGCCTAACT 
TCTGCGCTCT 

5461 GCTGAAGCCA GTTACCTTCG 
AACAAACCAC 

5521 CGCTGGTAGC GGTGGTTTTT 
AAAAAGGATC 

5581 TCAAGAAGAT CCTTTGATCT 
AAAACTCACG 

5641 TTAAGGGATT TTGGTCATGA 
TTTTAAATTA 

5701 AAAATGAAGT TTTAAATCAA 
ACAGTTACCA 

5761 ATGCTTAATC AGTGAGGCAC 
CCATAGTTGC 

5821 CTGACTCCCC GTCGTGTAGA 
GCCCCAGTGC 

5881 TGCAATGATA CCGCGAGACC 
TAAACCAGCC 



CCCGCATCGA TACCGTCGAC CTCGATCGAG 
GCAATACAGC AGCTACCAAT GCTGATTGTG 
TGGGTTTTCC AGTCACACCT CAGGTACCTT 
ATCTTAGCCA CTTTTTAAAA GAAAAGGGGG 
GACAAGATAT CCTTGATCTG TGGATCTACC 
ACTACACACC AGGGCCAGGG ATCAGATATC 
TACCAGTTGA GCAAGAGAAG GTAGAAGAAG 
ACCCTGTGAG CCTGCATGGG ATGGATGACC 
ACAGCCGCCT AG CATTTCAT CACATGGCCC 
TCTGGTTAGA CCAGATCTGA GCCTGGGAGC 
AGCCTCAATA AAGCTTGCCT TGAGTGCTTC 
CTGGTAACTA GAGATCCCTC AGACCCTTTT 
GAGCAAAAGG CCAGCAAAAG GCCAGGAACC 
ATAGGCTCCG CCCCCCTGAC GAGCATCACA 
ACCCGACAGG ACTATAAAGA TACCAGGCGT 
CTGTTCCGAC CCTGCCGCTT ACCGGATACC 
CGCTTTCTCA TAGCTCACGC TGTAGGTATC 
TGGGCTGTGT GCACGAACCC CCCGTTCAGC 
GTCTTGAGTC CAACCCGGTA AGACACGACT 
GGATTAGCAG AGCGAGGTAT GTAGGCGGTG 
ACGGCTACAC TAGAAGAACA GTATTTGGTA 
GAAAAAGAGT TGGTAGCTCT TGATCCGGCA 
TTGTTTGCAA GCAGCAGATT ACGCGCAGAA 
TTTCTACGGG GTCTGACGCT CAGTGGAACG 
GATTATCAAA AAGGATCTTC ACCTAGATCC 
TCTAAAGTAT ATATGAGTAA ACTTGGTCTG 
CTATCTCAGC GATCTGTCTA TTTCGTTCAT 
TAACTACGAT ACGGGAGGGC TTACCATCTG 
CACGCTCACC GGCTCCAGAT TTATCAGCAA 
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5941 AGCCGGAAGG GCCGAGCGCA 
TCCAGTCTAT 

6001 TAATTGTTGC CGGGAAGCTA 
GCAACGTTGT 
5 6061 TGCCATTGCT ACAGGCATCG 

CATTCAGCTC 

6121 CGGTTCCCAA CGATCAAGGC 
AAGCGGTTAG 

6181 CTCCTTCGGT CCTCCGATCG 
10 CACTCATGGT 

6241 TATGGCAGCA CTGCATAATT 
TTTCTGTGAC 

63 01 TGGTGAGTAC TCAACCAAGT 
GTTGCTCTTG 

15 63 61 CCCGGCGTCA ATACGGGATA 

TGCTCATCAT 

6421 TGGAAAACGT TCTTCGGGGC 
GATCCAGTTC 

64 81 GATGTAACCC ACTCGTGCAC 
20 CCAGCGTTTC 

6541 TGGGTGAGCA AAAACAGGAA 
CGACACGGAA 

6601 ATGTTGAATA CTCATACTCT 
AGGGTTATTG 
25 6661 TCTCATGAGC GGATACATAT 

GGGTTCCGCG 

6721 CACATTTCCC CGAAAAGTGC 

// 

30 (SEQ ID NO: 3) 



GAAGTGGTCC TGCAACTTTA TCCGCCTCCA 
GAGTAAGTAG TTCGCCAGTT AATAGTTTGC 
TGGTGTCACG CTCGTCGTTT GGTATGGCTT 
GAGTTACATG ATCCCCCATG TTGTGCAAAA 
TTGTCAGAAG TAAGTTGG CC GCAGTGTTAT 
CTCTTACTGT CATGCCATCC GTAAGATGCT 
CATTCTGAGA ATAGTGTATG CGGCGACCGA 
ATACCGCGCC ACATAGCAGA ACTTTAAAAG 
GAAAACTCTC AAGGATCTTA CCGCTGTTGA 
CCAACTGATC TTCAGCATCT TTTACTTTCA 
GGCAAAATGC CGCAAAAAAG GGAATAAGGG 
TCCTTTTTCA ATATTATTGA AG CATTTATC 
TTGAATGTAT TTAGAAAAAT AAACAAATAG 
CACCTGAC 
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LOCUS 
2002 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES 

promoter 



misc_recomb 
gene 

rep_origin 
misc_recomb 
LTR 

mis cofeature 
misc_f eature 
mis cofeature 
LTR 
gene 

1755 A 



6706 BP DS -DNA CIRCULAR SYN 



23 -JAN- 



Location/ Qualifiers 
212. .816 

/note="CMV promoter/ enhancer 1" 

3506. .3539 

/note="LoxP" 

5713 . .6573 

/note="AmpR ,? 

4895. .5568 

/note="pUC" 

2710. .2745 

/note="LoxP " 

835 . . 1509 

/note=»5' HIV R-U5-del gag (HIV NL4 -3/454-1126) » 
1539. .2396 

/note="HIV RRE (HIV NL4 -3/7622 -8459) » 

2422. .2599 

/note=»HIV Flap" 

3594. .4183 

/note="WRE element" 

4203. .4892 

/note="3' SIN LTR " 

2772 . .3452 

/note="dsRed2 » 

1638 C 1722 G 1591 T . 0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

301 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC 
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721 ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 
ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
TACTGGGTCT 

5 841 CTCTGGTTAG ACCAGATCTG AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 

CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT CAGACCCTTT TAGTCAGTGT GGAAAATCT C 
10 TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
ACGCCAAAAA 

15 1141 TTTTGACTAG CGGAGGCTAG AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 

ATTAAGCGGG 

1201 GGAGAATTAG ATCGCGATGG GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
20 AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC TGTAGACAAA TACTGGGACA GCTACAACCA 
TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA TCATTATATA ATACAGTAGC AACCCTCTAT 
TGTGTGCATC 

25 1441 AAAGGATAGA GATAAAAGAC ACCAAGGAAG CTTTAGACAA GATAGAGGAA 

GAGCAAAACA 

1501 AAAGTAAGAC CACCGCACAG CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG AAGTGAATTA TATAAATATA AAGTAGTAAA 
30 AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT TGGGTTCTTG GGAGCAGCAG GAAG CACTAT 
GGGCGCAGCG 

35 1741 TCAATGACGC TGACGGTACA GGCCAGACAA TTATTGTCTG GTATAGTGCA 

GCAGCAGAAC 

1801 AATTTGCTGA GGGCTATTGA GGCGCAACAG CATCTGTTGC AACTCACAGT 
CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT CCTGGCTGTG GAAAGATACC TAAAGGATCA 
40 ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG AAAACTCATT TGCACCACTG CTGTGCCTTG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA ACAGATTTGG AATCACACGA CCTGGATGGA 
GTGGGACAGA 

45 2041 GAAATTAACA ATTACACAAG CTTAATACAC TCCTTAATTG AAGAATCGCA 

AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG GTATATAAAA TTATTCATAA TGATAGTAGG 
50 AGGCTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
GGGATATTCA 

2281 CCATTATCGT TTCAGACCCA CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
CGAAGGAATA 

55 2341 GAAGAAGAAG GTGGAGAGAG AGACAGAGAC AGATCCATTC GATTAGTGAA 

CGGATCGGCA 

2401 CTGCGTGCGC CAATTCTGCA GACAAATGGC AGTATTCATC CACAATTTTA 
AAAGAAAAGG 
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2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

2581 CAGCAGAGAT CCAGTTTGGT 
CGCTAGCCGT 

2641 TAATTAAGCC TCGAGGTCGA 
CAGCAGGTCG 

2701 AGGGACCTAA TAACTTCGTA 
GTTCCAAGCT 

2761 TAAGCGGCCG CGCCACCATG 
ATGCGCTTCA 

2821 AGGTGCGCAT GGAGGGCACC 
GAGGGCGAGG 

2881 GCCGCCCCTA CGAGGGCCAC 
GGCCCCCTGC 

2941 CCTTCGCCTG GGACATCCTG 
TACGTGAAGC 

3001 ACCCCGCCGA CATCCCCGAC 
AAGTGGGAGC 

3 061 GCGTGATGAA CTTCGAGGAC 
TCCCTGCAGG 

3121 ACGGCTGCTT CATCTACAAG 
GACGGCCCCG 

3181 TGATGCAGAA GAAGACCATG 
CCCCGCGACG 

3241 GCGTGCTGAA GGGCGAGACC 
CACTACCTGG 

3301 TGGAGTTCAA GTCCATCTAC 
TACTACTACG 

3361 TGGACGCCAA GCTGGACATC 
GAGCAGTACG 

3421 AGCGCACCGA GGGCCGCCAC 
CGCGCCATGG 

3481 ATCCGCGAAT TCGTCGAGGG 
CGAAGTTATA 

3541 CATGTTTAAG GGTTCCGGTT 
CGATAATCAA 

3601 CCTCTGGATT ACAAAATTTG 
TGCTCCTTTT 

3661 ACGCTATGTG GATACGCTGC 
CCGTATGGCT 

3721 TTCATTTTCT CCTCCTTGTA 
GTTGTGGCCC 

3781 GTTGTCAGGC AACGTGGCGT 
CACTGGTTGG 

3841 GGCATTGCCA CCACCTGTCA 
CCCTATTGCC 

3901 ACGGCGGAAC TCATCGCCGC 
GCTGTTGGGC 

3 961 ACTGACAATT CCGTGGTGTT 
GCTCGCCTGT 

4021 GTTGCCACCT GGATTCTGCG 
CCTCAATCCA 

4081 GCGGACCTTC CTTCCCGCGG 
TCTTCGCCTT 

4141 CGCCCTCAGA CGAGTCGGAT 
CCGTCGACCT 



CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
GCCTCCTCCG AGAACGTCAT CACCGAGTTC 
GTGAACGGCC ACGAGTTCGA GATCGAGGGC 
AACAC CGTGA AGCTGAAGGT GACCAAGGGC 
TCCCCCCAGT TCCAGTACGG CTCCAAGGTG 
TACAAGAAGC TGTCCTTCCC CGAGGGCTTC 
GGCGGCGTGG CGACCGTGAC CCAGGACTCC 
GTGAAGTTCA TCGGCGTGAA CTTCCCCTCC 
GGCTGGGAGG CCTCCACCGA GCGCCTGTAC 
CACAAGGCCC TGAAGCTGAA GGACGGCGGC 
ATGGCCAAGA AGCCCGTGCA GCTGCCCGGC 
ACCTCCCACA ACGAGGACTA CACCATCGTG 
CACCTGTTCC TGATGCATGC CCCGGGATGG 
ACCTAATAAC TTCGTATAGC ATACATTATA 
CCACTAGGTA CAATTCGATA TCAAGCTTAT 
TGAAAGATTG ACTGGTATTC TTAACTATGT 
TTTAATGCCT TTGTATCATG CTATTGCTTC 
TAAATCCTGG TTGCTGTCTC TTTATGAGGA 
GGTGTGCACT GTGTTTGCTG ACGCAACCCC 
GCTCCTTTCC GGGACTTTCG CTTTCCCCCT 
CTGCCTTGCC CGCTGCTGGA CAGGGGCTCG 
GTCGGGGAAA TCATCGTCCT TTCCTTGGCT 
CGGGACGTCC TTCTGCTACG TCCCTTCGGC 
CCTGCTGCCG GCTCTGCGGC CTCTTCCGCG 
CTCCCTTTGG GCCGCCTCCC CGCATCGATA 
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4201 CGATCGAGAC CTAGAAAAAC 
CTACCAATGC 

4261 TGATTGTGCC TGGCTAGAAG 
TCACACCTCA 
5 4321 GGTACCTTTA AGACCAATGA 

TTTTAAAAGA 

43 81 AAAGGGGGGA CTGGAAGGGC 
TTGATCTGTG 

4441 GATCTACCAC ACACAAGGCT 
10 GGCCAGGGAT 

4501 CAGATATCCA CTGACCTTTG 
AAGAGAAGGT 

4561 AGAAGAAGCC AATGAAGGAG 
TGCATGGGAT 
15 4621 GGATGACCCG GAGAGAGAAG 

CATTTCATCA 

4681 CATGGCCCGA GAGCTGCATC 
AGATCTGAGC 

4741 CTGGGAGCTC TCTGGCTAAC 
20 GCTTGCCTTG 

4801 AGTGCTTCAA GTAGTGTGTG 
GATCCCTCAG 

4861 ACCCTTTTAG TCAGTGTGGA 
AGCAAAAGGC 
25 4921 CAGGAACCGT AAAAAGGCCG 

CCCCTGACGA 

4981 GCATCACAAA AATCGACGCT 
TATAAAGATA 

5041 CCAGGCGTTT CCCCCTGGAA 
30 TGCCGCTTAC 

5101 CGGATACCTG TCCGCCTTTC 
GCTCACGCTG 

5161 TAGGTATCTC AGTTCGGTGT 
ACGAACCCCC 
35 5221 CGTTCAGCCC GACCGCTGCG 

ACCCGGTAAG 

5281 ACACGACTTA TCGCCACTGG 
CGAGGTATGT 

5341 AGGCGGTGCT ACAGAGTTCT 
40 GAAGAACAGT 

5401 ATTTGGTATC TGCGCTCTGC 
GTAGCTCTTG 

5461 ATCCGGCAAA CAAACCACCG 
AGCAGATTAC 
45 5521 GCGCAGAAAA AAAGGATCTC 

CTGACGCTCA 

5581 GTGGAACGAA AACTCACGTT 
GGATCTTCAC 

5641 CTAGATCCTT TTAAATTAAA 
50 ATGAGTAAAC 

5701 TTGGTCTGAC AGTTACCAAT 
TCTGTCTATT 

5761 TCGTTCATCC ATAGTTGCCT 
GGGAGGGCTT 
55 5821 ACCATCTGGC CCCAGTGCTG 

CTCCAGATTT 

5881 ATCAGCAATA AACCAGCCAG 
CAACTTTATC 



ATGGAGCAAT CACAAGTAGC AATACAGCAG 
CACAAGAGGA GGAGGAGGTG GGTTTTCCAG 
CTTACAAGGC AGCTGTAGAT CTTAGCCACT 
TAATTCACTC CCAACGAAGA CAAGATATCC 
ACTTCCCTGA TTGGCAGAAC TAC AC AC C AG 
GATGGTGCTA CAAGCTAGTA CCAGTTGAGC 
AGAACACCCG CTTGTTACAC CCTGTGAGCC 
TATTAGAGTG GAGGTTTGAC AGCCGCCTAG 
CGGACTGTAC TGGGTCTCTC TGGTTAGACC 
TAGGGAACCC ACTGCTTAAG CCTCAATAAA 
CCCGTCTGTT GTGTGACTCT GGTAACTAGA 
AAATCTCTAG CAGCATGTGA GCAAAAGGCC 
CGTTGCTGGC GTTTTTCCAT AGGCTCCGCC 
CAAGTCAGAG GTGGCGAAAC CCGACAGGAC 
GCTCCCTCGT GCGCTCTCCT GTTCCGACCC 
TCCCTTCGGG AAGCGTGGCG CTTTCTCATA 
AGGTCGTTCG CTCCAAGCTG GGCTGTGTGC 
CCTTATCCGG TAACTATCGT CTTGAGTCCA 
CAGCAGCCAC TGGTAACAGG ATTAGCAGAG 
TGAAGTGGTG GCCTAACTAC GGCTACACTA 
TGAAGCCAGT TACCTTCGGA AAAAGAGTTG 
CTGGTAGCGG TGGTTTTTTT GTTTGCAAGC 
AAGAAGATCC TTTGATCTTT TCTACGGGGT 
AAGGGATTTT GGTCATGAGA TTATCAAAAA 
AATGAAGTTT TAAATCAATC TAAAGTATAT 
GCTTAATCAG TGAGGCACCT ATCTCAGCGA 
GACTCCCCGT CGTGTAGATA ACTACGATAC 
CAATGATACC GCGAGACCCA CGCTCACCGG 
CCGGAAGGGC CGAGCGCAGA AGTGGTCCTG 
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5941 CGCCTCCATC 
CGCCAGTTAA 

6001 TAGTTTGCGC 
CGTCGTTTGG 

6061 TATGGCTTCA 
CCCCCATGTT 

6121 GTGCAAAAAA 
AGTTGGCCGC 

6181 AGTGTTATCA 
TGCCATCCGT 

6241 AAGATGCTTT 
AGTGTATGCG 

6301 GCGACCGAGT 
ATAGCAGAAC 

6361 TTTAAAAGTG 
GGATCTTACC 

6421 GCTGTTGAGA 
CAGCATCTTT 

6481 TACTTTCACC 
CAAAAAAGGG 

6541 AATAAGGGCG 
ATTATTGAAG 

6601 CATTTATCAG 
AGAAAAATAA 

6661 ACAAATAGGG 

II 

(SEQ ID NO: 4) 



CAGTCTATTA ATTGTTGCCG 
AACGTTGTTG CCATTGCTAC 
TTCAGCTCCG GTTCCCAACG 
GCGGTTAGCT CCTTCGGTCC 
CTCATGGTTA TGGCAGCACT 
TCTGTGACTG GTGAGTACTC 
TGCTCTTGCC CGGCGTCAAT 
CTCATCATTG GAAAACGTTC 
TCCAGTTCGA TGTAACCCAC 
AGCGTTTCTG GGTGAGCAAA 
ACACGGAAAT GTTGAATACT 
GGTTATTGTC TCATGAGCGG 
GTTCCGCGCA CATTTCCCCG 



GGAAGCTAGA GTAAGTAGTT 
AGGCATCGTG GTGTCACGCT 
ATCAAGGCGA GTTACATGAT 
TCCGATCGTT GTCAGAAGTA 
GCATAATTCT CTTACTGTCA 
AACCAAGTCA TTCTGAGAAT 
ACGGGATAAT ACCGCGCCAC 
TTCGGGGCGA AAACTCTCAA 
TCGTGCACCC AACTGATCTT 
AACAGGAAGG CAAAATGCCG 
CATACTCTTC CTTTTTCAAT 
ATACATATTT GAATGTATTT 
AAAAGTGCCA CCTGAC 
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[00348] pLL3.3 

LOCUS PLENTILOX 
2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



7248 BP DS-DNA CIRCULAR SYN 



23 -JAN- 



gene 

rep__origin 
misc__recomb 
misc_recomb 
LTR 

misc_feature 
misc_f eature 
mis cofeature 
LTR 
frag 

promoter 



1815 A 



Location/Qualifiers 
212. .816 
/note="CMV promoter/enhancer 1" 
6255. .7115 
/note= !, AmpR" 
5437. .6110 
/note=»pUC" 
3931. .3966 
/note="Lox 1" 
4048. .4081 
/note= n Lox2 " 
835. .1509 

/note="5' HIV R-U5-del gag (HIV NL4-3/454-1126) » 
1539. .2396 

/note="HIV RRE (HIV NL4 -3/7622 -8459) " 

2422 . .2599 

/note="HIV Flap" 

4136. .4725 

/note="WRE element" 

4745. .5434 

/note="3' SIN LTR" 

2627. .3847 

/note="13 to 1233 of pUB6/V5-HisA" 
2632. .3841 
/note="UbC promoter" 

1695 C 1947 G 1791 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGC CAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 
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661 TTGACTCACG GGGATTTCCA 
TTGTTTTGGC 

721 ACCAAAATCA ACGGGACTTT 
ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
ATTAAGCGGG 

1201 GGAGAATTAG ATCGCGATGG 
AAAAAATATA 

1261 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 

1501 AAAGTAAGAC CACCGCACAG 
TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG 
AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 

1801 AATTTGCTGA GGGCTATTGA 
CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 

2 041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 

2341 GAAGAAGAAG GTGGAGAGAG 
CGGATCGGCA 



AGTCTCCACC CCATTGACGT CAATGGGAGT 
CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
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2401 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 
5 2521 AACTAAAGAA TTACAAAAAC 

ATTACAGGGA 

2581 CAGCAGAGAT CCAGTTTGGT 
TGGCCTCCGC 

2641 GCCGGGTTTT GGCGCCTCCC 
10 GCCACGTCAG 

2701 ACGAAGGGCG CAGGAGCGTC 
CGGCCCGCTG 

2761 CTCATAAGAC TCGGCCTTAG 
GACGGGACTT 
15 2821 GGGTGACTCT AGGGCAGTGG 

AAAAGTAGTC 

2881 CCTTCTCGGC GATTCTGCGG 
GATTATATAA 

2941 GGACGCGCCG GGTGTGGCAC 
20 CGCGGTTCTT 

3 001 GTTTGTGGAT CGCTGTGATC 
GGCCGGGGCT 

3 061 TTCGTGGCCG CCGGGCCGCT 
CAAGGGCTGT 
25 3121 AGTCTGGGTC CGCGAGCAAG 

CAGCAAAATG 

3181 GCGGCTGTTC CCGAGTCTTG 
GGTCGTTGAA 

3241 ACAAGGTGGG GGGCATGGTG 
30 CGCTAATGCG 

33 01 GGAAAGCTCT TATTCGGGTG 
GACGTGAAGT 

33 61 TTGTCACTGA CTGGAGAACT 
TATGCGGTGC 
35 3421 CGTTGGGCAG TGCACCCGTA 

TGACGTCACC 

3481 CGTTCTGTTG GCTTATAATG 
GGTAGGCTTT 

3541 TCTCCGTCGC AGGACGCAGG 
40 GACAGGCGCC 

3601 GGACCTCTGG TGAGGGGAGG 
TTTTATGTAC 

3661 CTATCTTCTT AAGTAGCTGA 
TGGCGAGTGT 
45 3721 GTTTTGTGAA GTTTTTTAGG 

TATGTAATTT 

3781 TCAGTGTTAG ACTAGTAAAT 
TTTTGTTAGA 

3841 CGAAGCTAAC GCGCTAGCCG 
50 TAAGCTCGCT 

3 901 TCACGAGATT CCAGCAGGTC 
TTATACGAAG 

3961 TTATATTAAG GGTTCCAAGC 
GGCGCGCCAT 
55 4021 GGATCCGCGA ATTCGTCGAG 

TACGAAGTTA 

4 081 TACATGTTTA AGGGTTCCGG 
ATCGATAATC 



GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTGATC 
GCGGGCGCCC CCCTCCTCAC GGCGAGCGCT 
CTGATCCTTC CGCCCGGACG CTCAGGACAG 
AACCCCAGTA TCAGCAGAAG GACATTTTAG 
TTTTCTTTCC AGAGAGCGGA ACAGGCGAGG 
AGGGATCTCC GTGGGGCGGT GAACGCCGAT 
AGCTAGTTCC GTCGCAGCCG GGATTTGGGT 
GTCACTTGGT GAGTAGCGGG CTGCTGGGCT 
CGGTGGGACG GAAGCGTGTG GAGAGACCGC 
GTTGCCCTGA ACTGGGGGTT GGGGGGAGCG 
AATGGAAGAC GCTTGTGAGG CGGGCTGTGA 
GGCGGCAAGA ACCCAAGGTC TTGAGGCCTT 
AGATGGGCTG GGGCACCATC TGGGGACCCT 
CGGTTTGTCG TCTGTTGCGG GGGCGGCAGT 
CCTTTGGGAG CGCGCGCCCT CGTCGTGTCG 
CAGGGTGGGG CCACCTGCCG GTAGGTGTGC 
GTTCGGGCCT AGGGTAGGCT CTCCTGAATC 
GATAAGTGAG GCGTCAGTTT CTTTGGTCGG 
AGCTCCGGTT TTGAACTATG CGCTCGGGGT 
CACCTTTTGA AATGTAATCA TTTGGGTCAA 
TGTCCGCTAA ATTCTGGCCG TTTTTGGCTT 
TTAATTAAGC CTCGAGGTCG ACGGTATCGA 
GAGGGACCTA ATAACTTCGT ATAGCATACA 
TTAAGCGGCC GCCGATGCAT GCCCCGGGAT 
GGACCTAATA ACTTCGTATA GCATACATTA 
TTCCACTAGG TACAATTCGA TATCAAGCTT 
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4141 AACCTCTGGA TTACAAAATT 
GTTGCTCCTT 

42 01 TTACGCTATG TGGATACGCT 
TCCCGTATGG 
5 4261 CTTTCATTTT CTCCTCCTTG 

GAGTTGTGGC 

4321 CCGTTGTCAG GCAACGTGGC 
CCCACTGGTT 

4381 GGGGCATTGC CACCACCTGT 
10 CTCCCTATTG 

4441 CCACGGCGGA ACTCATCGCC 
CGGCTGTTGG 

4501 GCACTGACAA TTCCGTGGTG 
CTGCTCGCCT 
15 4561 GTGTTGCCAC CTGGATTCTG 

GCCCTCAATC 

4621 CAGCGGACCT TCCTTCCCGC 
CGTCTTCGCC 

4681 TTCGCCCTCA GACGAGTCGG 
20 TACCGTCGAC 

4741 CTCGATCGAG ACCTAGAAAA 
AGCTACCAAT 

4 801 GCTGATTGTG CCTGGCTAGA 
AGTC AC AC CT 
25 4861 CAGGTACCTT TAAGACCAAT 

CTTTTTAAAA 

4921 GAAAAGGGGG GACTGGAAGG 
CCTTGATCTG 

4981 TGGATCTACC ACACACAAGG 
30 AGGGCCAGGG 

5041 ATCAGATATC CACTGACCTT 
GCAAGAGAAG 

5101 GTAGAAGAAG CCAATGAAGG 
CCTGCATGGG 
35 5161 ATGGATGACC CGGAGAGAGA 

AGCATTTCAT 

5221 CACATGGCCC GAGAGCTGCA 
CCAGATCTGA 

52 81 GCCTGGGAGC TCTCTGGCTA 
40 AAGCTTGCCT 

5341 TGAGTGCTTC AAGTAGTGTG 
GAGATCCCTC 

5401 AGACCCTTTT AGTCAGTGTG 
CCAGCAAAAG 
45 5461 GCCAGGAACC GTAAAAAGGC 

CCCCCCTGAC 

5521 GAGCATCACA AAAATCGACG 
ACTATAAAGA 

5581 TACCAGGCGT TTCCCCCTGG 
50 CCTGCCGCTT 

5641 ACCGGATACC TGTCCGCCTT 
TAGCTCACGC 

5701 TGTAGGTATC TCAGTTCGGT 
GCACGAACCC 
55 5761 CCCGTTCAGC CCGACCGCTG 

CAACCCGGTA 

5821 AGACACGACT TATCGCCACT 
AGCGAGGTAT 



TGTGAAAGAT TGACTGGTAT TCTTAACTAT 
GCTTTAATGC CTTTGTATCA TGCTATTGCT 
TATAAATCCT GGTTGCTGTC TCTTTATGAG 
GTGGTGTGCA CTGTGTTTGC TGACGCAACC 
CAGCTCCTTT CCGGGACTTT CGCTTTCCCC 
GCCTGCCTTG CCCGCTGCTG GACAGGGGCT 
TTGTCGGGGA AATCATCGTC CTTTCCTTGG 
CGCGGGACGT CCTTCTGCTA CGTCCCTTCG 
GGCCTGCTGC CGGCTCTGCG GCCTCTTCCG 
ATCTCCCTTT GGGCCGCCTC CCCGCATCGA 
ACATGGAGCA ATCACAAGTA GCAATACAGC 
AGCACAAGAG GAGGAGGAGG TGGGTTTTCC 
GACTTACAAG GCAGCTGTAG ATCTTAGCCA 
GCTAATTCAC TCCCAACGAA GACAAGATAT 
CTACTTCCCT GATTGGCAGA ACTACACACC 
TGGATGGTGC TACAAGCTAG TACCAGTTGA 
AGAGAACACC CGCTTGTTAC ACCCTGTGAG 
AGTATTAGAG TGGAGGTTTG ACAGCCGCCT 
TCCGGACTGT ACTGGGTCTC TCTGGTTAGA 
ACTAGGGAAC CCACTGCTTA AGCCTCAATA 
TGCCCGTCTG TTGTGTGACT CTGGTAACTA 
GAAAATCTCT AGCAGCATGT GAGCAAAAGG 
CGCGTTGCTG GCGTTTTTCC ATAGGCTCCG 
CTCAAGTCAG AGGTGGCGAA ACCCGACAGG 
AAGCTCCCTC GTGCGCTCTC CTGTTCCGAC 
TCTCCCTTCG GGAAGCGTGG CGCTTTCTCA 
GTAGGTCGTT CGCTCCAAGC TGGGCTGTGT 
CGCCTTATCC GGTAACTATC GTCTTGAGTC 
GGCAGCAGCC ACTGGTAACA GGATTAGCAG 
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5881 GTAGGCGGTG CTACAGAGTT CTTGAAGTGG TGGCCTAACT ACGGCTACAC 
TAGAAGAACA 

5941 GTATTTGGTA TCTGCGCTCT GCTGAAGCCA GTTACCTTCG GAAAAAGAGT 
TGGTAGCTCT 

5 6001 TGATCCGGCA AACAAACCAC CGCTGGTAGC GGTGGTTTTT TTGTTTGCAA 

GCAGCAGATT 

6061 ACGCGCAGAA AAAAAGGATC TCAAGAAGAT CCTTTGATCT TTTCTACGGG 
GTCTGACGCT 

6121 CAGTGGAACG AAAACTCACG TTAAGGGATT TTGGTCATGA GATTATCAAA 
10 AAGGATCTTC 

6181 ACCTAGATCC TTTTAAATTA AAAATGAAGT TTTAAATCAA TCTAAAGTAT 
ATATGAGTAA 

6241 ACTTGGTCTG ACAGTTACCA ATGCTTAATC AGTGAGGCAC CTATCTCAGC 
GATCTGTCTA 

15 6301 TTTCGTTCAT CCATAGTTGC CTGACTCCCC GTCGTGTAGA TAACTACGAT 

ACGGGAGGGC 

63 61 TTACCATCTG GCCCCAGTGC TG CAATG AT A CCGCGAGACC CACGCTCACC 
GGCTCCAGAT 

6421 TTATCAGCAA TAAACCAGCC AGCCGGAAGG GCCGAGCGCA GAAGTGGTCC 
20 TGCAACTTTA 

64 81 TCCGCCTCCA TCCAGTCTAT TAATTGTTGC CGGGAAGCTA GAGTAAGTAG 
TTCGCCAGTT 

6541 AATAGTTTGC GCAACGTTGT TGCCATTGCT ACAGGCATCG TGGTGTCACG 
CTCGTCGTTT 

25 6601 GGTATGGCTT CATTCAGCTC CGGTTCCCAA CGATCAAGGC GAGTTACATG 

ATCCCCCATG 

6661 TTGTGCAAAA AAGCGGTTAG CTCCTTCGGT CCTCCGATCG TTGTCAGAAG 
TAAGTTGGCC 

6721 GCAGTGTTAT CACTCATGGT TATGGCAGCA CTGCATAATT CTCTTACTGT 
30 CATGCCATCC 

6781 GTAAGATGCT TTTCTGTGAC TGGTGAGTAC TCAACCAAGT CATTCTGAGA 
ATAGTGTATG 

6841 CGGCGACCGA GTTGCTCTTG CCCGGCGTCA ATACGGGATA ATACCGCGCC 
ACATAGCAGA 

35 6901 ACTTTAAAAG TGCTCATCAT TGGAAAACGT TCTTCGGGGC GAAAACTCTC 

AAGGATCTTA 

6961 CCGCTGTTGA GATCCAGTTC GATGTAACCC ACTCGTGCAC CCAACTGATC 
TTCAGCATCT 

7021 TTTACTTTCA CCAGCGTTTC TGGGTGAGCA AAAACAGGAA GGCAAAATGC 
40 CGCAAAAAAG 

70 81 GGAATAAGGG CGACACGGAA ATGTTGAATA CTCATACTCT TCCTTTTTCA 
ATATTATTGA 

7141 AGCATTTATC AGGGTTATTG TCTCATGAGC GGATACATAT TTGAATGTAT 
TTAGAAAAAT 

45 7201 AAACAAATAG GGGTTCCGCG CACATTTCCC CGAAAAGTGC CACCTGAC 

// 

(SEQ ID NO: 5) 
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[00349] 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



LOCUS 
2002 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES 

promoter 



pLL3.4 

PLENTILOX 



7969 BP DS-DNA CIRCULAR SYN 



23-JAN- 



misc_recomb 
promoter 
gene 

rep_origin 
misc_recomb 
LTR 

misc_f eature 
misc_f eature 
misc_f eature 
LTR 
gene 

1988 A 



Location/ Qualifiers 
212. .816 

/note="CMV promoter/ enhancer 1" 
4769. .4802 
/note="LoxP M 
2632. .3841 

/note="UbC promoter" 
6976. .7836 
/note="AmpR" 
6158. .6831 
/note="pUC" 
3931. .3966 
/note="LoxP" 
835 . .1509 

/note="5' HIV R-U5-del gag (HIV NL4 -3/454- 1126 ) " 
1539. .2396 

/note="HIV RRE (HIV NL4 -3/7622 -8459) » 
2422 . .2599 



/note="HIV Flap" 
4.857 . .5446 
/note="WRE element" 
5466. .6155 
/note="3' SIN LTR" 
3993 . .4715 
/note="EGFP" 

1938 C 2150 G 



1893 T 



0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 
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661 TTGACTCACG GGGATTTCCA 
TTGTTTTGGC 

721 ACCAAAATCA ACGGGACTTT 
ACGCAAATGG 
5 781 GCGGTAGGCG TGTACGGTGG 

TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 

901 AAGC CTCAAT AAAGCTTGCC 
10 GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 
15 1081 GGCTTGCTGA AGCGCGCACG 

ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
ATTAAGCGGG 

1201 GGAGAATTAG ATCGCGATGG 
20 AAAAAATATA 

1261 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 
25 1381 CAGGATCAGA AGAACTTAGA 

TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 

1501 AAAGTAAGAC CACCGCACAG 
30 TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG 
AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 
35 1681 GGAATAGGAG CTTTGTTCCT 

GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 

1801 AATTTGCTGA GGGCTATTGA 
40 CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 
45 1981 TGGAGTAATA AATCTCTGGA 

GTGGGACAGA 

2041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
50 GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 
55 2281 CCATTATCGT TTCAGACCCA 

CGAAGGAATA 

2341 GAAGAAGAAG GTGGAGAGAG 
CGGATCGGCA 



AGTCTCCACC CCATTGACGT CAATGGGAGT 
CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
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2401 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

2581 CAGCAGAGAT CCAGTTTGGT 
TGGCCTCCGC 

2 641 GCCGGGTTTT GGCGCCTCCC 
GCCACGTCAG 

2701 ACGAAGGGCG CAGGAGCGTC 
CGGCCCGCTG 

2761 CTCATAAGAC TCGGCCTTAG 
GACGGGACTT 

2 821 GGGTGACTCT AGGGCACTGG 
AAAAGTAGTC 

2 881 CCTTCTCGGC GATTCTGCGG 
GATTATATAA 

2941 GGACGCGCCG GGTGTGGCAC 
CGCGGTTCTT 

3 001 GTTTGTGGAT CGCTGTGATC 
GGCCGGGGCT 

3 061 TTCGTGGCCG CCGGGCCGCT 
CAAGGGCTGT 

3121 AGTCTGGGTC CGCGAGCAAG 
CAGCAAAATG 

3181 GCGGCTGTTC CCGAGTCTTG 
GGTCGTTGAA 

3241 ACAAGGTGGG GGGCATGGTG 
CGCTAATGCG 

3301 GGAAAGCTCT TATTCGGGTG 
GACGTGAAGT 

33 61 TTGTCACTGA CTGGAGAACT 
TATGCGGTGC 

3421 CGTTGGGCAG TGCACCCGTA 
TGACGTCACC 

3481 CGTTCTGTTG GCTTATAATG 
GGTAGGCTTT 

3541 TCTCCGTCGC AGGACGCAGG 
GACAGGCGCC 

3 601 GGACCTCTGG TGAGGGGAGG 
TTTTATGTAC 

3 661 CTATCTTCTT AAGTAGCTGA 
TGGCGAGTGT 

3721 GTTTTGTGAA GTTTTTTAGG 
TATGTAATTT 

3781 TCAGTGTTAG ACTAGTAAAT 
TTTTGTTAGA 

3 841 CGAAGCTAAC GCGCTAGCCG 
TAAGCTCGCT 

3 901 TCACGAGATT CCAGCAGGTC 
TTATACGAAG 

3961 TTATATTAAG GGTTCCAAGC 
GGCGAGGAGC 

4 021 TGTTCACCGG GGTGGTGCCC 
GGCCACAAGT 

4 081 TCAGCGTGTC CGGCGAGGGC 
CTGAAGTTCA 



GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTGATC 
GCGGGCGCCC CCCTCCTCAC GGCGAGCGCT 
CTGATCCTTC CGCCCGGACG CTCAGGACAG 
AACCCCAGTA TCAGCAGAAG GACATTTTAG 
TTTTCTTTCC AGAGAGCGGA ACAGGCGAGG 
AGGGATCTCC GTGGGGCGGT GAACGCCGAT 
AGCTAGTTCC GTCGCAGCCG GGATTTGGGT 
GTCACTTGGT GAGTAGCGGG CTGCTGGGCT 
CGGTGGGACG GAAGCGTGTG GAGAGACCGC 
GTTGCCCTGA ACTGGGGGTT GGGGGGAGCG 
AATGGAAGAC GCTTGTGAGG CGGGCTGTGA 
GGCGGCAAGA ACCCAAGGTC TTGAGGCCTT 
AGATGGGCTG GGGCACCATC TGGGGACCCT 
CGGTTTGTCG TCTGTTGCGG GGGCGGCAGT 
CCTTTGGGAG CGCGCGCCCT CGTCGTGTCG 
CAGGGTGGGG CCACCTGCCG GTAGGTGTGC 
GTTCGGGCCT AGGGTAGGCT CTCCTGAATC 
GATAAGTGAG GCGTCAGTTT CTTTGGTCGG 
AGCTCCGGTT TTGAACTATG CGCTCGGGGT 
CACCTTTTGA AATGTAATCA TTTGGGTCAA 
TGTCCGCTAA ATTCTGGCCG TTTTTGGCTT 
TTAATTAAGC CTCGAGGTCG ACGGTATCGA 
GAGGGACCTA ATAACTTCGT ATAGCATACA 
TTAAGCGGCC GCGCCACCAT GGTGAGCAAG 
ATCCTGGTCG AGCTGGACGG CGACGTAAAC 
GAGGGCGATG CCACCTACGG CAAGCTGACC 
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4141 TCTGCACCAC CGGCAAGCTG 
CTGACCTACG 

4201 GCGTGCAGTG CTTCAGCCGC 
TTCAAGTCCG 

4261 CCATGCCCGA AGGCTACGTC 
GGCAACTACA 

4321 AGACCCGCGC CGAGGTGAAG 
GAGCTGAAGG 

43 81 GCATCGACTT CAAGGAGGAC 
AACTACAACA 

4441 GCCACAACGT CTATATCATG 
AACTTCAAGA 

4501 TCCGCCACAA CATCGAGGAC 
CAGAACACCC 

4561 CCATCGGCGA CGGCCCCGTG 
CAGTCCGCCC 

4621 TGAGCAAAGA CCCCAACGAG 
GTGACCGCCG 

4681 CCGGGATCAC TCTCGGCATG 
TGGCGCGCCA 

4741 TGGATCCGCG AATTCGTCGA 
ATACGAAGTT 

4 801 ATACATGTTT AAGGGTTCCG 
TATCGATAAT 

4 861 CAACCTCTGG ATTACAAAAT 
TGTTGCTCCT 

4921 TTTACGCTAT GTGGATACGC 
TTCC CGTATG 

4981 GCTTTCATTT TCTCCTCCTT 
GGAGTTGTGG 

5041 CCCGTTGTCA GGCAACGTGG 
CCCCACTGGT 

5101 TGGGGCATTG CCACCACCTG 
CCTCCCTATT 

5161 GCCACGGCGG AACTCATCGC 
TCGGCTGTTG 

5221 GGCACTGACA ATTCCGTGGT 
GCTGCTCGCC 

5281 TGTGTTGCCA CCTGGATTCT 
GGCCCTCAAT 

5341 CCAGCGGACC TTCCTTCCCG 
GCGTCTTCGC 

5401 CTTCGCCCTC AGACGAGTCG 
ATACCGTCGA 

5461 CCTCGATCGA GACCTAGAAA 
CAGCTACCAA 

5521 TGCTGATTGT GCCTGGCTAG 
CAGTCACACC 

5581 TCAGGTACCT TTAAGACCAA 
ACTTTTTAAA 

5641 AGAAAAGGGG GGACTGGAAG 
TCCTTGATCT 

5701 GTGGATCTAC CACACACAAG 
CAGGGCCAGG 

5761 GATCAGATAT CCACTGACCT 
AGCAAGAGAA 

5821 GGTAGAAGAA GCCAATGAAG 
GCCTGCATGG 



CCCGTGCCCT GGCCCACCCT CGTGACCACC 
TACCCCGACC ACATGAAGCA GCACGACTTC 
CAGGAGCGCA CCATCTTCTT CAAGGACGAC 
TTCGAGGGCG ACACCCTGGT GAACCGCATC 
GGCAACATCC TGGGGCACAA GCTGGAGTAC 
GCCGACAAGC AGAAGAACGG CATCAAGGTG 
GGCAGCGTGC AGCTCGCCGA CCACTACCAG 
CTGCTGCCCG ACAACCACTA CCTGAGCACC 
AAGCGCGATC ACATGGTCCT GCTGGAGTTC 
GACGAGCTGT ACAAGATGCA TGCCCCGGGA 
GGGACCTAAT AACTTCGTAT AGCATACATT 
GTTCCACTAG GTACAATTCG ATATCAAGCT 
TTGTGAAAGA TTGACTGGTA TTCTTAACTA 
TGCTTTAATG CCTTTGTATC ATGCTATTGC 
GTATAAATCC TGGTTGCTGT CTCTTTATGA 
CGTGGTGTGC ACTGTGTTTG CTGACGCAAC 
TCAGCTCCTT TCCGGGACTT TCGCTTTCCC 
CGCCTGCCTT GCCCGCTGCT GGACAGGGGC 
GTTGTCGGGG AAATCATCGT CCTTTCCTTG 
GCGCGGGACG TCCTTCTGCT ACGTCCCTTC 
CGGCCTGCTG CCGGCTCTGC GGCCTCTTCC 
GATCTCCCTT TGGGCCGCCT CCCCGQATCG 
AACATGGAGC AATCACAAGT AGCAATACAG 
AAGCACAAGA GGAGGAGGAG GTGGGTTTTC 
TGACTTACAA GGCAGCTGTA GATCTTAGCC 
GGCTAATTCA CTCCCAACGA AGACAAGATA 
GCTACTTCCC TGATTGGCAG AACTACACAC 
TTGGATGGTG CTACAAGCTA GTACCAGTTG 
GAGAGAACAC CCGCTTGTTA CACCCTGTGA 
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5881 GATGGATGAC CCGGAGAGAG 
TAGCATTTCA 

5941 TCACATGGCC CGAGAGCTGC 
ACCAGATCTG 
5 6001 AGCCTGGGAG CTCTCTGGCT 

AAAGCTTGCC 

6061 TTGAGTGCTT CAAGTAGTGT 
AGAGATCCCT 

6121 CAGACCCTTT TAGTCAGTGT 
10 GCCAGCAAAA 

6181 GGCCAGGAAC CGTAAAAAGG 
GCCCCCCTGA 

6241 CGAGCATCAC AAAAATCGAC 
GACTATAAAG 
15 6301 ATACCAGGCG TTTCCCCCTG 

CCCTGCCGCT 

6361 TACCGGATAC CTGTCCGCCT 
ATAGCTCACG 

6421 CTGTAGGTAT CTCAGTTCGG 
20 TGCACGAACC 

64 81 CCCCGTTCAG CCCGACCGCT 
CCAACCCGGT 

6541 AAGACACGAC TTATCGCCAC 
GAGCGAGGTA 
25 6601 TGTAGGCGGT GCTACAGAGT 

CTAGAAGAAC 

6661 AGTATTTGGT ATCTGCGCTC 
TTGGTAGCTC 

6721 TTGATCCGGC AAACAAACCA 
30 AGCAGCAGAT 

6781 TACGCGCAGA AAAAAAGGAT 
GGTCTGACGC 

6841 TCAGTGGAAC GAAAACTCAC 
AAAGGATCTT 
35 6901 CACCTAGATC CTTTTAAATT 

TATATGAGTA 

6961 AACTTGGTCT GACAGTTACC 
CGATCTGTCT 

7021 ATTTCGTTCA TCCATAGTTG 
40 TACGGGAGGG 

7081 CTTACCATCT GGCCCCAGTG 
CGGCTCCAGA 

7141 TTTATCAGCA ATAAACCAGC 
CTGCAACTTT 
45 7201 ATCCGCCTCC ATCCAGTCTA 

GTTCGCCAGT 

7261 TAATAGTTTG CGCAACGTTG 
GCTCGTCGTT 

7321 TGGTATGGCT TCATTCAGCT 
50 GATCCCCCAT 

7381 GTTGTGCAAA AAAGCGGTTA 
GTAAGTTGGC 

7441 CGCAGTGTTA TCACTCATGG 
TCATGCCATC 
55 7501 CGTAAGATGC TTTTCTGTGA 

AATAGTGTAT 

7561 GCGGCGACCG AGTTGCTCTT 
CACATAGCAG 



AAGTATTAGA GTGGAGGTTT GACAGCCGCC 
ATCCGGACTG TACTGGGTCT CTCTGGTTAG 
AACTAGGGAA CCCACTGCTT AAGCCTCAAT 
GTGCCCGTCT GTTGTGTGAC TCTGGTAACT 
GGAAAATCTC TAGCAGCATG TGAGCAAAAG 
CCGCGTTGCT GGCGTTTTTC CATAGGCTCC 
GCTCAAGTCA GAGGTGGCGA AACCCGACAG 
GAAGCTCCCT CGTGCGCTCT CCTGTTCCGA 
TTCTCCCTTC GGGAAGCGTG GCGCTTTCTC 
TGTAGGTCGT TCGCTCCAAG CTGGGCTGTG 
GCGCCTTATC CGGTAACTAT CGTCTTGAGT 
TGGCAGCAGC CACTGGTAAC AGGATTAG C A 
TCTTGAAGTG GTGGCCTAAC TACGGCTACA 
TGCTGAAGCC AGTTACCTTC GGAAAAAGAG 
CCGCTGGTAG CGGTGGTTTT TTTGTTTGCA 
CTCAAGAAGA TCCTTTGATC TTTTCTACGG 
GTTAAGGGAT TTTGGTCATG AGATTATCAA 
AAAAATGAAG TTTTAAATCA ATCTAAAGTA 
AATGCTTAAT CAGTGAGGCA CCTATCTCAG 
CCTGACTCCC CGTCGTGTAG ATAACTACGA 
CTGCAATGAT ACCGCGAGAC CCACGCTCAC 
CAGCCGGAAG GGCCGAGCGC AGAAGTGGTC 
TTAATTGTTG CCGGGAAGCT AGAGTAAGTA 
TTGCCATTGC TACAGGCATC GTGGTGTCAC 
CCGGTTCCCA ACGATCAAGG CGAGTTACAT 
GCTCCTTCGG TCCTCCGATC GTTGTCAGAA 
TTATGGCAGC ACTGCATAAT TCTCTTACTG 
CTGGTGAGTA CTCAACCAAG TCATTCTGAG 
GCCCGGCGTC AATACGGGAT AATACCGCGC 
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7621 AACTTTAAAA GTGCTCATCA TTGGAAAACG TTCTTCGGGG CGAAAACTCT 
CAAGGATCTT 

7681 ACCGCTGTTG AGATCCAGTT CGATGTAACC CACTCGTGCA CCCAACTGAT 
CTTCAGCATC 

5 7741 TTTTACTTTC ACCAGCGTTT CTGGGTGAGC AAAAACAGGA AGGCAAAATG 

CCGCAAAAAA 

7 801 GGGAATAAGG GCGACACGGA AATGTTGAAT ACTCATACTC TTCCTTTTTC 
AATATTATTG 

7861 AAGCATTTAT CAGGGTTATT GTCTCATGAG CGGATACATA TTTGAATGTA 
10 TTTAGAAAAA 

7921 TAAACAAATA GGGGTTCCGC GCACATTTCC CCGAAAAGTG CCACCTGAC 

// 



(SEQ ID NO: 6) 

15 
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[00350] pLL3.5 

LOCUS PLL3 . 5 . GB 

2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



7927 BP DS-DNA CIRCULAR SYN 



23-JAN- 



misc_recomb 

promoter 

gene 

rep_origin 

misc_recomb 

LTR 

misc_f eature 
misc_f eature 
misc_f eature 
LTR 



Location/ Qualifiers 
212. .816 
/note="CMV promoter/ enhancer 1" 
4727 . .4760 
/note="LoxP" 
2632 . .3841 
/note="UbC promoter" 
6934 . .7794 
/note="AmpR M 
6116. .6789 
/note="pUC" 
3931. .3966 
/note="LoxP " 
835. .1509 

/note^'S* HIV R-U5-del gag (HIV NL4 -3/454-1126 ) " 
1539. .2396 

/note="HIV RRE (HIV NL4-3/7622 -8459) " 
2422 . .2599 



gene 



/note="HIV Flap" 
4815. .5404 
/note="WRE element" 
5424. .6113 
/note="3' SIN LTR" 
3993. .4673 
/note="dsRed2" 
1958 A 1925 C 2151 G 



1893 T 



0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CG ACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

301 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 
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661 TTGACTCACG GGGATTTCCA 
TTGTTTTGGC 

721 ACCAAAATCA ACGGGACTTT 
ACGCAAATGG 
5 7 81 GCGGTAGGCG TGTACGGTGG 

TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC 
0 GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
CGCAGGACTC 
5 1081 GGCTTGCTGA AGCGCGCACG 

ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
ATTAAGCGGG 

1201 GGAGAATTAG ATCGCGATGG 
0 AAAAAATATA 

1261 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC 
TCCCTTCAGA 
5 1381 CAGGATCAGA AGAACTTAGA 

TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 

1501 AAAGTAAGAC CACCGCACAG 
0 TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG 
AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
AAGAGCAGTG 
5 1681 GGAATAGGAG CTTTGTTCCT 

GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 

1801 AATTTGCTGA GGGCTATTGA 
0 CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
GAATGCTAGT 
5 1981 TGGAGTAATA AATCTCTGGA 

GTGGGACAGA 

2041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT 
0 GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 
5 2281 CCATTATCGT TTCAGACCCA 

CGAAGGAATA 

2341 GAAGAAGAAG GTGGAGAGAG 
CGGATCGGCA 



AGTCTCCACC CCATTGACGT CAATGGGAGT 
CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
TCATTATATA ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
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24 01 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

2461 GGGGATTGGG GGGTACAGTG 
CAGACATACA 
5 2521 AACTAAAGAA TTACAAAAAC 

ATTACAGGGA 

2581 CAGCAGAGAT CCAGTTTGGT 
TGGCCTCCGC 

2 641 GCCGGGTTTT GGCGCCTCCC 
10 GCCACGTCAG 

2701 ACGAAGGGCG CAGGAGCGTC 
CGGCCCGCTG 

2 761 CTCATAAGAC TCGGCCTTAG 
GACGGGACTT 
15 2 821 GGGTGACTCT AGGGCACTGG 

AAAAGTAGTC 

2881 CCTTCTCGGC GATTCTGCGG 
GATTATATAA 

2 941 GGACGCGCCG GGTGTGGCAC 
20 CGCGGTTCTT 

3 001 GTTTGTGGAT CGCTGTGATC 
GGCCGGGGCT 

3061 TTCGTGGCCG CCGGGCCGCT 
CAAGGGCTGT 
25 3121 AGTCTGGGTC CGCGAGCAAG 

CAGCAAAATG 

3181 GCGGCTGTTC CCGAGTCTTG 
GGTCGTTGAA 

3241 ACAAGGTGGG GGGCATGGTG 
30 CGCTAATGCG 

3301 GGAAAGCTCT TATTCGGGTG 
GACGTGAAGT 

3361 TTGTCACTGA CTGGAGAACT 
TATGCGGTGC 
35 3421 CGTTGGGCAG TGCACCCGTA 

TGACGTCACC 

3481 CGTTCTGTTG GCTTATAATG 
GGTAGGCTTT 

3541 TCTCCGTCGC AGGACGCAGG 
40 GACAGGCGCC 

3601 GGACCTCTGG TGAGGGGAGG 
TTTTATGTAC 

3661 CTATCTTCTT AAGTAGCTGA 
TGGCGAGTGT 
45 3721 GTTTTGTGAA GTTTTTTAGG 

TATGTAATTT 

3781 TCAGTGTTAG ACTAGTAAAT 
TTTTGTTAGA 

3841 CGAAGCTAAC GCGCTAGCCG 
50 TAAGCTCGCT 

3901 TCACGAGATT CCAGCAGGTC 
TTATACGAAG 

3961 TTATATTAAG GGTTCCAAGC 
GAGAACGTCA 
55 4021 TCACCGAGTT CATGCGCTTC 

CACGAGTTCG 

4081 AGATCGAGGG CGAGGGCGAG 
AAGCTGAAGG 



GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTGATC 
GCGGGCGCCC CCCTCCTCAC GGCGAGCGCT 
CTGATCCTTC CGCCCGGACG CTCAGGACAG 
AACCCCAGTA TCAGCAGAAG GACATTTTAG 
TTTTCTTTCC AGAGAGCGGA ACAGGCGAGG 
AGGGATCTCC GTGGGGCGGT GAACGCCGAT 
AGCTAGTTCC GTCGCAGCCG GGATTTGGGT 
GTCACTTGGT GAGTAGCGGG CTGCTGGGCT 
CGGTGGGACG GAAGCGTGTG GAGAGACCGC 
GTTGCCCTGA ACTGGGGGTT GGGGGGAGCG 
AATGGAAGAC GCTTGTGAGG CGGGCTGTGA 
GGCGGCAAGA ACCCAAGGTC TTGAGGCCTT 
AGATGGGCTG GGGCACCATC TGGGGACCCT 
CGGTTTGTCG TCTGTTGCGG GGGCGGCAGT 
CCTTTGGGAG CGCGCGCCCT CGTCGTGTCG 
CAGGGTGGGG CCACCTGCCG GTAGGTGTGC 
GTTCGGGCCT AGGGTAGGCT CTCCTGAATC 
GATAAGTGAG GCGTCAGTTT CTTTGGTCGG 
AGCTCCGGTT TTGAACTATG CGCTCGGGGT 
CACCTTTTGA AATGTAATCA TTTGGGTCAA 
TGTCCGCTAA ATTCTGGCCG TTTTTGGCTT 
TTAATTAAGC CTCGAGGTCG ACGGTATCGA 
GAGGGACCTA ATAACTTCGT ATAGCATACA 
TTAAGCGGCC GCGCCACCAT GGCCTCCTCC 
AAGGTGCGCA TGGAGGGCAC CGTGAACGGC 
GGCCGCCCCT ACGAGGGCCA CAACACCGTG 
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4141 TGACCAAGGG CGGCCCCCTG CCCTTCGCCT GGGACATCCT GTCCCCCCAG 
TTCCAGTACG 

4201 GCTCCAAGGT GTACGTGAAG CACCCCGCCG ACATCCCCGA CTACAAGAAG 
CTGTCCTTCC 

5 4261 CCGAGGGCTT CAAGTGGGAG CGCGTGATGA ACTTCGAGGA CGGCGGCGTG 

GCGACCGTGA 

4321 CCCAGGACTC CTCCCTGCAG GACGGCTGCT TCATCTACAA GGTGAAGTTC 
ATCGGCGTGA 

43 81 ACTTCCCCTC CGACGGCCCC GTGATGCAGA AGAAGACCAT GGGCTGGGAG 
10 GCCTCCACCG 

4441 AGCGCCTGTA CCCCCGCGAC GGCGTGCTGA AGGGCGAGAC CCACAAGGCC 
CTGAAGCTGA 

4501 AGGACGGCGG CCACTACCTG GTGGAGTTCA AGTCCATCTA CATGGCCAAG 
AAGCCCGTGC 

15 4561 AGCTGCCCGG CTACTACTAC GTGGACGCCA AGCTGGACAT CACCTCCCAC 

AACGAGGACT 

4621 ACACCATCGT GGAGCAGTAC GAGCGCACCG AGGGCCGCCA CCACCTGTTC 
CTGATGCATG 

4681 CCCCGGGATG GCGCGCCATG GATCCGCGAA TTCGTCGAGG GACCTAATAA 
20 CTTCGTATAG 

4741 CATACATTAT ACGAAGTTAT ACATGTTTAA GGGTTCCGGT TCCACTAGGT 
ACAATTCGAT 

4801 ATCAAGCTTA TCGATAATCA ACCTCTGGAT TACAAAATTT GTGAAAGATT 
GACTGGTATT 

25 4861 CTTAACTATG TTGCTCCTTT TACGCTATGT GGATACGCTG CTTTAATGCC 

TTTGTATCAT 

4921 GCTATTGCTT CCCGTATGGC TTTCATTTTC TCCTCCTTGT ATAAATCCTG 
GTTGCTGTCT 

4981 CTTTATGAGG AGTTGTGGCC CGTTGTCAGG CAACGTGGCG TGGTGTGCAC 
30 TGTGTTTGCT 

5041 GACGCAACCC CCACTGGTTG GGGCATTGCC ACCACCTGTC AGCTCCTTTC 
CGGGACTTTC 

5101 GCTTTCCCCC TCCCTATTGC CACGGCGGAA CTCATCGCCG CCTGCCTTGC 
CCGCTGCTGG 

35 5161 ACAGGGGCTC GGCTGTTGGG CACTGACAAT TCCGTGGTGT TGTCGGGGAA 

ATCATCGTCC 

5221 TTTCCTTGGC TGCTCGCCTG TGTTGCCACC TGGATTCTGC GCGGGACGTC 
CTTCTGCTAC — ■ ■■ 

5281 GTCCCTTCGG CCCTCAATCC AGCGGACCTT CCTTCCCGCG GCCTGCTGCC 
40 GGCTCTGCGG 

5341 CCTCTTCCGC GTCTTCGCCT TCGCCCTCAG ACGAGTCGGA TCTCCCTTTG 
GGCCGCCTCC 

5401 CCGCATCGAT ACCGTCGACC TCGATCGAGA CCTAGAAAAA CATGGAGCAA 
TCACAAGTAG 

45 5461 CAATACAGCA GCTACCAATG CTGATTGTGC CTGGCTAGAA GCACAAGAGG 

AGGAGGAGGT 

5521 GGGTTTTCCA GTCACACCTC AGGTACCTTT AAGACCAATG ACTTACAAGG 
CAGCTGTAGA 

558 1 TCTTAGCCAC TTTTTAAAAG AAAAGGGGGG ACTGGAAGGG CTAATTCACT 
50 CCCAACGAAG 

5641 ACAAGATATC CTTGATCTGT GGATCTACCA CACACAAGGC TACTTCCCTG 
ATTGGCAGAA 

5701 CTACACACCA GGGCCAGGGA TCAGATATCC ACTGACCTTT GGATGGTGCT 
ACAAGCTAGT 

55 5761 ACCAGTTGAG CAAGAGAAGG TAGAAGAAGC CAATGAAGGA GAGAACACCC 

GCTTGTTACA 

5821 CCCTGTGAGC CTGCATGGGA TGGATGACCC GGAGAGAGAA GTATTAGAGT 
GGAGGTTTGA 
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5881 CAGCCGCCTA GCATTTCATC 
CTGGGTCTCT 

5941 CTGGTTAGAC CAGATCTGAG 
CACTGCTTAA 
5 6001 GCCTCAATAA AGCTTGCCTT 

TGTGTGACTC 

6061 TGGTAACTAG AGATCCCTCA 
GCAGCATGTG 

6121 AGCAAAAGGC CAGCAAAAGG 
10 CGTTTTTCCA 

6181 TAGGCTCCGC CCCCCTGACG 
GGTGGCGAAA 

6241 CCCGACAGGA CTATAAAGAT 
TGCGCTCTCC 
15 63 01 TGTTCCGACC CTGCCGCTTA 

GAAGCGTGGC 

6361 GCTTTCTCAT AGCTCACGCT 
GCTCCAAGCT 

6421 GGGCTGTGTG CACGAACCCC 
20 GTAACTATCG 

6481 TCTTGAGTCC AACCCGGTAA 
CTGGTAACAG 

6541 GATTAGCAGA GCGAGGTATG 
GGCCTAACTA 
25 6601 CGGCTACACT AGAAGAACAG 

TTACCTTCGG 

6661 AAAAAGAGTT GGTAGCTCTT 
GTGGTTTTTT 

6721 TGTTTGCAAG CAGCAGATTA 
30 CTTTGATCTT 

6781 TTCTACGGGG TCTGACGCTC 
TGGTCATGAG 

6841 ATTATCAAAA AGGATCTTCA 
TTAAATCAAT 
35 6901 CTAAAGTATA TATGAGTAAA 

GTGAGGCACC 

6961 TATCTCAGCG ATCTGTCTAT 
TCGTGTAGAT 

7021 AACTACGATA CGGGAGGGCT 
40 CGCGAGACCC 

7081 ACGCTCACCG GCTCCAGATT 
CCGAGCGCAG 

7141 AAGTGGTCCT GCAACTTTAT 
GGGAAGCTAG 
45 7201 AGTAAGTAGT TCGCCAGTTA 

CAGGCATCGT 

7261 GGTGTCACGC TCGTCGTTTG 
GATCAAGGCG 

7321 AGTTACATGA TCCCCCATGT 
50 CTCCGATCGT 

7381 TGTCAGAAGT AAGTTGGCCG 
TGCATAATTC 

7441 TCTTACTGTC ATGCCATCCG 
CAACCAAGTC 
55 7501 ATTCTGAGAA TAGTGTATGC 

TACGGGATAA 

7561 TACCGCGCCA CATAGCAGAA 
CTTCGGGGCG 



ACATGGCCCG AGAGCTGCAT CCGGACTGTA 
CCTGGGAGCT CTCTGGCTAA CTAGGGAACC 
GAGTGCTTCA AGTAGTGTGT GCCCGTCTGT 
GACCCTTTTA GTCAGTGTGG AAAATCTCTA 
CCAGGAACCG TAAAAAGGCC GCGTTGCTGG 
AGCATCACAA AAATCGACGC TCAAGTCAGA 
ACCAGGCGTT TCCCCCTGGA AGCTCCCTCG 
CCGGATACCT GTCCGCCTTT CTCCCTTCGG 
GTAGGTATCT CAGTTCGGTG TAGGTCGTTC 
CCGTTCAGCC CGACCGCTGC GCCTTATCCG 
GACACGACTT ATCGCCACTG GCAGCAGCCA 
TAGGCGGTGC TACAGAGTTC TTGAAGTGGT 
TATTTGGTAT CTGCGCTCTG CTGAAGCCAG 
GATCCGGCAA ACAAACCACC GCTGGTAGCG 
CGCGCAGAAA AAAAGGATCT CAAGAAGATC 
AGTGGAACGA AAACTCACGT TAAGGGATTT 
CCTAGATCCT TTTAAATTAA AAATGAAGTT 
CTTGGTCTGA CAGTTACCAA TGCTTAATCA 
TTCGTTCATC CATAGTTGCC TGACTCCCCG 
TACCATCTGG CCCCAGTGCT GCAATGATAC 
TATCAGCAAT AAACCAGCCA GCCGGAAGGG 
CCGCCTCCAT CCAGTCTATT AATTGTTGCC 
ATAGTTTGCG CAACGTTGTT GCCATTGCTA 
GTATGGCTTC ATTCAGCTCC GGTTCCCAAC 
TGTGCAAAAA AGCGGTTAGC TCCTTCGGTC 
CAGTGTTATC ACTCATGGTT ATGGCAGCAC 
TAAGATGCTT TTCTGTGACT GGTGAGTACT 
GGCGACCGAG TTGCTCTTGC CCGGCGTCAA 
CTTTAAAAGT GCTCATCATT GGAAAACGTT 
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7621 AAAACTCTCA AGGATCTTAC CGCTGTTGAG ATCCAGTTCG ATGTAACCCA 
CTCGTGCACC 

7681 CAACTGATCT TCAGCATCTT TTACTTTCAC CAGCGTTTCT GGGTGAGCAA 
AAACAGGAAG 

5 7741 GCAAAATGCC GCAAAAAAGG GAATAAGGGC GACACGGAAA TGTTGAATAC 

TCATACTCTT 

7801 CCTTTTTCAA TATTATTGAA GCATTTATCA GGGTTATTGT CTCATGAGCG 
GATACATATT 

7861 TGAATGTATT TAGAAAAATA AACAAATAGG GGTTCCGCGC ACATTTCCCC 
10 GAAAAGTGCC 

7921 ACCTGAC 

// 

(SEQ ID NO: 7) 

15 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



[00351] pLL3.6 

LOCUS PLENTILOX 
2002 

DEFINITION - 
ACCESSION 
KEYWORDS 
SOURCE 
FEATURES 

promoter 



7350 BP DS-DNA CIRCULAR SYN 



23 -JAN - 



promoter 
gene 

re P_origin 
misc__recomb 
misc_recomb 
LTR 

misc_f eature 
misc_f eature 
mi sc_f eature 
LTR 
frag 
frag 
frag 
gene 

1939 A 



Location/Qualifiers 
212. .816 
/note="CMV promoter/enhancer 1" 
2799. .3387 
/note="CMV" 
6357. .7217 
/note= M AmpR" 
5539.. 6212 
/note="pUC" 
2710. .2745 
/note="Lox 1" 
4150 . .4183 
/note="LoxP» 
835. .1509 

/note= ,, 5' HIV R-U5-del gag (HIV NL4 -3/454 - 1126 ) " 
1539. .2396 

/note="HIV RRE (HIV NL4 -3/7622-8459) " 
2422 . .2599 
/note="HIV Flap" 
4238 . .4827 
/note="WRE element" 
4847. .5536 
/note="3 f SIN LTR" 
2772.. 4130 

/note=»l to 1359 of Untitledl" 
2772. .2798 

/note="4705 to 4731 of pEGFP-Cl" 
2799. .4127 

/note="l to 1329 of pEGFP-Cl" 
3404. .4127 
/note="EGFP" 

1795 C 1862 G 1754 T 0 OTHER 



BASE COUNT 
ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 
ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
CATAGCCCAT 

3 01 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

3 61 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGGACTT 

421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 
GTACATCAAG 
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481 TGTATCATAT GCCAAGTACG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC 
TACGTATTAG 
5 601 TCATCGCTAT TACCATGGTG 

GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA 
TTGTTTTGGC 

721 ACCAAAATCA ACGGGACTTT 
10 ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG 
CCCACTGCTT 
15 901 AAGCCTCAAT AAAGCTTGCC 

GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT 
TAGCAGTGGC 

1021 GCCCGAACAG GGACTTGAAA 
20 CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG 
ATTAAGCGGG 
25 1201 GGAGAATTAG ATCGCGATGG 

AAAAAATATA 

12 61 AATTAAAACA TATAGTATGG 
AATCCTGGCC 

1321 TGTTAGAAAC ATCAGAAGGC 
30 TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC 
GAGCAAAACA 
35 1501 AAAGTAAGAC CACCGCACAG 

TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG 
AATTGAACCA 

1621 TTAGGAGTAG CACCCACCAA 
40 AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT 
GGGCGCAGCG 

1741 TCAATGACGC TGACGGTACA 
GCAGCAGAAC 
45 1801 AATTTGCTGA GGGCTATTGA 

CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT 
ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG 
50 GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA 
GTGGGACAGA 

2041 GAAATTAACA ATTACACAAG 
AAACCAGCAA 
55 2101 GAAAAGAATG AACAAGAATT 

GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG 
AGGCTTGGTA 



CCCCCTATTG ACGTCAATGA CGGTAAATGG 
TTATGGGACT TTCCTACTTG GCAGTACATC 
ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
AGTCTCCACC CCATTGACGT CAATGGGAGT 
CCAAAATGTC GTAACAACTC CGCCCCATTG 
GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 
GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
AAGGAGAGAG ATGGGTGCGA GAG CGTCAGT 
GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
TGTAGACAAA TACTGGGACA GCTACAACCA 
T C ATTAT AT A ATACAGTAGC AACCCTCTAT 
ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
AAGTGAATTA TATAAATATA AAGTAGTAAA 
GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGCCAGACAA TTATTGTCTG GTATAGTGCA 
GGCGCAACAG CATCTGTTGC AACTCACAGT 
CCTGGCTGTG GAAAGATACC TAAAGGATCA 
AAAACTCATT TGCACCACTG CTGTGCCTTG 
ACAGATTTGG AATCACACGA CCTGGATGGA 
CTTAATACAC TCCTTAATTG AAGAATCGCA 
ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GTATATAAAA TTATTCATAA TGATAGTAGG 
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2221 GGTTTAAGAA TAGTTTTTGC 
GGGATATTCA 

2281 CCATTATCGT TTCAGACCCA 
CGAAGGAATA 
5 2341 GAAGAAGAAG GTGGAGAGAG 

CGGATCGGCA 

2401 CTGCGTGCGC CAATTCTGCA 
AAAGAAAAGG 

2461 GGGGATTGGG GGGTACAGTG 
10 CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC 
ATTACAGGGA 

25 81 CAGCAGAGAT CCAGTTTGGT 
CGCTAGCCGT 
15 2 641 TAATTAAGCC TCGAGGTCGA 

CAGCAGGTCG 

2701 AGGGACCTAA TAACTTCGTA 
GTTCCAAGCT 

2761 TAAGCGGCCG CGTGGATAAC 
20 GTAATCAATT 

2 821 ACGGGGTCAT TAGTTCATAG 
TACGGTAAAT 

2881 GGCCCGCCTG GCTGACCGCC 
GACGTATGTT 
25 2 941 CCCATAGTAA CGCCAATAGG 

TTTACGGTAA 

3 001 ACTGCCCACT TGGCAGTACA 
TATTGACGTC 

3 061 AATGACGGTA AATGGCCCGC 
30 GGACTTTCCT 

3121 ACTTGGCAGT ACATCTACGT 
GTTTTGGCAG 

3181 TACATCAATG GGCGTGGATA 
CCACCCCATT 
35 3241 GACGTCAATG GGAGTTTGTT 

ATGTCGTAAC 

33 01 AACTCCGCCC CATTGACGCA 
CTATATAAGC 

33 61 AGAGCTGGTT TAGTGAACCG 
40 ATGGTGAGCA 

3421 AGGGCGAGGA GCTGTTCACC 
GGCGACGTAA 

34 81 ACGGCCACAA GTTCAGCGTG 
GGCAAGCTGA 

45 3541 CCCTGAAGTT CATCTGCACC 

CTCGTGACCA 

3 601 CCCTGACCTA CGGCGTGCAG 
CAGCACGACT 

3661 TCTTCAAGTC CGCCATGCCC 
50 TTCAAGGACG 

3721 ACGGCAACTA CAAGACCCGC 
GTGAACCGCA 

37 81 TCGAGCTGAA GGGCATCGAC 
AAGCTGGAGT 
55 3841 ACAACTACAA CAGCCACAAC 

GGCATCAAGG 

3901 TGAACTTCAA GATCCGCCAC 
GACCACTACC 



TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
AGACAGAGAC AGATCCATTC GATTAGTGAA 
GACAAATGGC AGTATTCATC CACAATTTTA 
CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
AAATTACAAA AATTCAAAAT TTTCGGGTTT 
TAGTACCGGG CCCGCTCTAG ACGGTTAACG 
CGGTATCGAT AAGCTCGCTT CACGAGATTC 
TAGCATACAT TATACGAAGT TATATTAAGG 
CGTATTACCG CCATGCATTA GTTATTAATA 
CCCATATATG GAGTTCCGCG TTACATAACT 
CAACGACCCC CGCCCATTGA CGTCAATAAT 
GACTTTCCAT TGACGTCAAT GGGTGGAGTA 
TCAAGTGTAT CATATGCCAA GTACGCCCCC 
CTGGCATTAT GCCCAGTACA TGACCTTATG 
ATTAGTCATC GCTATTACCA TGGTGATGCG 
GCGGTTTGAC TCACGGGGAT TTCCAAGTCT 
TTGGCACCAA AATCAACGGG ACTTTC CAAA 
AATGGGCGGT AGGCGTGTAC GGTGGGAGGT 
TCAGATCCGC TAGCGCTACC GGTCGCCACC 
GGGGTGGTGC CCATCCTGGT CGAGCTGGAC 
TCCGGCGAGG GCGAGGGCGA TGCCACCTAC 
ACCGGCAAGC TGCCCGTGCC CTGGCCCACC 
TGCTTCAGCC GCTACCCCGA CCACATGAAG 
GAAGGCTACG TCCAGGAGCG CACCATCTTC 
GCCGAGGTGA AGTTCGAGGG CGACACCCTG 
TTCAAGGAGG ACGGCAACAT CCTGGGGCAC 
GTCTATATCA TGGCCGACAA GCAGAAGAAC 
AACATCGAGG ACGGCAGCGT GCAGCTCGCC 
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3 961 AGCAGAACAC CCCCATCGGC 
TACCTGAGCA 

4 021 CCCAGTCCGC CCTGAGCAAA 
CTGCTGGAGT 

5 4081 TCGTGACCGC CGCCGGGATC 

GAATTCGTCG 

4141 AGGGACCTAA TAACTTCGTA 
TAAGGGTTCC 

42 01 GGTTCCACTA GGTACAATTC 
10 GATTACAAAA 

4261 TTTGTGAAAG ATTGACTGGT 
TGTGGATACG 

4321 CTGCTTTAAT GCCTTTGTAT 
TTCTCCTCCT 
15 43 81 TGTATAAATC CTGGTTGCTG 

AGGCAACGTG 

4441 GCGTGGTGTG CACTGTGTTT 
GCCACCACCT 

4501 GTCAGCTCCT TTCCGGGACT 
20 GAACTCATCG 

4561 CCGCCTGCCT TGCCCGCTGC 
AATTCCGTGG 

4621 TGTTGTCGGG GAAATCATCG 
ACCTGGATTC 
25 4681 TGCGCGGGAC GTCCTTCTGC 

CTTCCTTCCC 

4741 GCGGCCTGCT GCCGGCTCTG 
CAGACGAGTC 

4801 GGATCTCCCT TTGGGCCGCC 
30 AGACCTAGAA 

4 861 AAACATGGAG CAATCACAAG 
TGCCTGGCTA 

4 921 GAAGCACAAG AGGAGGAGGA 
TTTAAGACCA 
35 4981 ATGACTTACA AGGCAGCTGT 

GGGACTGGAA 

5041 GGGCTAATTC ACTCCCAACG 
CCACACACAA 

5101 GGCTACTTCC CTGATTGGCA 
40 TCCACTGACC 

5161 TTTGGATGGT GCTACAAGCT 
AGCCAATGAA 

5221 GGAGAGAACA CCCGCTTGTT 
CCCGGAGAGA 
45 5281 GAAGTATTAG AGTGGAGGTT 

CCGAGAGCTG 

5341 CATCCGGACT GTACTGGGTC 
GCTCTCTGGC 

54 01 TAACTAGGGA ACCCACTGCT 
50 TCAAGTAGTG 

5461 TGTGCCCGTC TGTTGTGTGA 
TTAGTCAGTG 

5521 TGGAAAATCT CTAGCAGCAT 
CCGTAAAAAG 
55 5581 GCCGCGTTGC TGGCGTTTTT 

CAAAAATCGA 

5641 CGCTCAAGTC AGAGGTGGCG 
GTTTCCCCCT 



GACGGCCCCG TGCTGCTGCC CGACAACCAC 
GACCCCAACG AGAAGCGCGA TCACATGGTC 
ACTCTCGGCA TGGACGAGCT GTACAAGTAG 
TAGCATACAT TATACGAAGT TATACATGTT 
GATATCAAGC TTATCGATAA TCAACCTCTG 
ATTCTTAACT ATGTTGCTCC TTTTACGCTA 
CATGCTATTG CTTCCCGTAT GGCTTTCATT 
TCTCTTTATG AGGAGTTGTG GCCCGTTGTC 
GCTGACGCAA CCCCCACTGG TTGGGGCATT 
TTCGCTTTCC CCCTCCCTAT TGCCACGGCG 
TGGACAGGGG CTCGGCTGTT GGGCACTGAC 
TCCTTTCCTT GGCTGCTCGC CTGTGTTGCC 
TACGTCCCTT CGGCCCTCAA TCCAGCGGAC 
CGGCCTCTTC CGCGTCTTCG CCTTCGCCCT 
TCCCCGCATC GATACCGTCG ACCTCGATCG 
TAGCAATACA GCAGCTACCA ATGCTGATTG 
GGTGGGTTTT CCAGTCACAC CTCAGGTACC 
AGATCTTAGC CACTTTTTAA AAGAAAAGGG 
AAGACAAGAT ATCCTTGATC TGTGGATCTA 
GAACTACACA CCAGGGCCAG GGATCAGATA 
AGTACCAGTT GAGCAAGAGA AGGTAGAAGA 
ACACCCTGTG AGCCTGCATG GGATGGATGA 
TGACAGCCGC CTAGCATTTC ATCACATGGC 
TCTCTGGTTA GACCAGATCT GAGCCTGGGA 
TAAGC CTCAA TAAAGCTTGC CTTGAGTGCT 
CTCTGGTAAC TAGAGATCCC TCAGACCCTT 
GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA 
CCATAGGCTC CGCCCCCCTG ACGAGCATCA 
AAACCCGACA GGACTATAAA GATACCAGGC 



Page 143 of 171 



WO 2004/022722 



PCT/US2003/028111 



5701 GGAAGCTCCC TCGTGCGCTC TCCTGTTCCG ACCCTGCCGC TTACCGGATA 
CCTGTCCGCC 

5761 TTTCTCCCTT CGGGAAGCGT GGCGCTTTCT CATAGCTCAC GCTGTAGGTA 
TCTCAGTTCG 

5 5821 GTGTAGGTCG TTCGCTCCAA GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 

GCCCGACCGC 

58 81 TGCGCCTTAT CCGGTAACTA TCGTCTTGAG TCCAACCCGG TAAGACACGA 
CTTATCGCCA 

5941 CTGGCAGCAG CCACTGGTAA CAGGATTAGC AGAGCGAGGT ATGTAGGCGG 
10 TGCTACAGAG 

6001 TTCTTGAAGT GGTGGCCTAA CTACGGCTAC ACTAGAAGAA CAGTATTTGG 
TATCTGCGCT 

6061 CTGCTGAAGC CAGTTACCTT CGGAAAAAGA GTTGGTAGCT CTTGATCCGG 
CAAACAAACC 

15 6121 ACCGCTGGTA GCGGTGGTTT TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 

AAAAAAAGGA 

6181 TCTCAAGAAG ATCCTTTGAT CTTTTCTACG GGGTCTGACG CTCAGTGGAA 
CGAAAACTCA 

6241 CGTTAAGGGA TTTTGGTCAT GAGATTATCA AAAAGGATCT TCACCTAGAT 
20 CCTTTTAAAT 

63 01 TAAAAATGAA GTTTTAAATC AATCTAAAGT ATATATGAGT AAACTTGGTC 
TGACAGTTAC 

63 61 CAATGCTTAA TCAGTGAGGC ACCTATCTCA GCGATCTGTC TATTTCGTTC 
ATCCATAGTT 

25 6421 GCCTGACTCC CCGTCGTGTA GATAACTACG ATACGGGAGG GCTTACCATC 

TGGCCCCAGT 

64 81 GCTGCAATGA TACCGCGAGA CCCACGCTCA CCGGCTCCAG ATTTATCAGC 
AATAAACCAG 

6541 CCAGCCGGAA GGGCCGAGCG CAGAAGTGGT CCTGCAACTT TATCCGCCTC 
30 CATCCAGTCT 

6601 ATTAATTGTT GCCGGGAAGC TAGAGTAAGT AGTTCGCCAG TTAATAGTTT 
GCGCAACGTT 

6661 GTTGCCATTG CTACAGGCAT CGTGGTGTCA CGCTCGTCGT TTGGTATGGC 
TTCATTCAGC 

35 6721 TCCGGTTCCC AACGATCAAG GCGAGTTACA TGATCCCCCA TGTTGTGCAA 

AAAAGCGGTT 

6781 AGCTCCTTCG GTCCTCCGAT CGTTGTCAGA AGTAAGTTGG CCGCAGTGTT 
ATCACTCATG 

6841 GTTATGGCAG CACTGCATAA TTCTCTTACT GTCATGCCAT CCGTAAGATG 
40 CTTTTCTGTG 

6901 ACTGGTGAGT ACTCAACCAA GTCATTCTGA GAATAGTGTA TGCGGCGACC 
GAGTTGCTCT 

6961 TGCCCGGCGT CAATACGGGA TAATACCGCG CCACATAGCA GAACTTTAAA 
AGTGCTCATC 

45 7021 ATTGGAAAAC GTTCTTCGGG GCGAAAACTC TCAAGGATCT TACCGCTGTT 

GAGATCCAGT 

7081 TCGATGTAAC CCACTCGTGC ACCCAACTGA TCTTCAGCAT CTTTTACTTT 
CACCAGCGTT 

7141 TCTGGGTGAG CAAAAACAGG AAGGCAAAAT GCCGCAAAAA AGGGAATAAG 
50 GGCGACACGG 

7201 AAATGTTGAA TACTCATACT CTTCCTTTTT CAATATTATT GAAGCATTTA 
TCAGGGTTAT 

7261 TGTCTCATGA GCGGATACAT ATTTGAATGT ATTTAGAAAA ATAAACAAAT 
AGGGGTTCCG 

55 7321 CGCACATTTC CCCGAAAAGT GCCACCTGAC 

// 

(SEQ ID NO: 8) 
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[00352] pLL3.7 



PLL3 . 7 . GB 



10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



LOCUS 
2002 

DEFINITION - 

ACCESSION 

KEYWORDS 

SOURCE 

FEATURES 

promoter 



misc_recomb 

promoter 

gene 

rep_origin 

misc_recomb 

LTR 

misc_f eature 

misc_f eature 

misc_f eature 

LTR 

frag 

frag 

frag 

gene 

frag 

frag 

source 

promoter 
misc_f eature 
misc_f eature 
misc_f eature 
promoter 
to PSE or 



7650 BP DS-DNA CIRCULAR SYN 



23- JAN - 



Location/Qualifiers 
212. .816 

/note="CMV promoter/enhancer 1" 

4450 . .4483 

/note="Lox 2" 

3099. .3687 

/note="CMV" 

6657. .7517 

/note="AmpR" 

5839. .6512 

/note="pUC" 

3010. .3045 

/note="Lox 1" 

835. .1509 

/note="5' HIV R-U5-del gag (HIV NL4 -3/454-1126) M 
1539. .2396 

/note="HIV RRE (HIV NL4-3/7622-8459) « 

2422. .2599 

/note="HIV Flap" 

4538 . .5127 

/note="WRE element" 

5147. .5836 

/note="3 ' SIN LTR" 

3072 . .4430 

/note="l to 1359 of Untitledl" 
3072. .3098 

/note="4705 to 4731 of pEGFP-Cl" 
3099. .4427 

/note="l to 1329 of pEGFP-Cl" 
3704 . .4427 
/note="EGFP" 
2617. .2950 

/note="l to 334 of Untitled2" 
2622. .2935 

/note="l to 314 of mouseu6" 
2622. ,>2935 

/organism="Mus musculus" 
/ db_xr e f = " t axon : 1 0 0 9 0 » 
/clone="pmU6-52BE [Split] " 
2622. ,>2935 

/note="U6 Promoter [Split]" 
2648. .2658 

/note="pot. SPI binding site" 
2692 . .2701 

/note="pot. SPI binding site" 
2707. .2714 

/note="pot . enhancer" 
2869. .2888 

/note="pot. promoter region; sequence homologous 
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element 'B' 11 
promoter 2906. .2911 

/note="put . TATA -box" 
BASE COUNT 2032 A 1861 C 1917 G 1840 T 0 OTHER 

5 ORIGIN 

1 GTCGACGGAT CGGGAGATCT CCCGATCCCC TATGGTGCAC TCTCAGTACA 
ATCTGCTCTG 

61 ATGCCGCATA GTTAAGCCAG TATCTGCTCC CTGCTTGTGT GTTGGAGGTC 
GCTGAGTAGT 

10 121 GCGCGAGCAA AATTTAAGCT ACAACAAGGC AAGGCTTGAC CGACAATTGC 

ATGAAGAATC 

181 TGCTTAGGGT TAGGCGTTTT GCGCTGCTTC GCGATGTACG GGCCAGATAT 
ACGCGTTGAC 

241 ATTGATTATT GACTAGTTAT TAATAGTAAT CAATTACGGG GTCATTAGTT 
15 CATAGCCCAT 

301 ATATGGAGTT CCGCGTTACA TAACTTACGG TAAATGGCCC GCCTGGCTGA 
CCGCCCAACG 

361 ACCCCCGCCC ATTGACGTCA ATAATGACGT ATGTTCCCAT AGTAACGCCA 
ATAGGG AC TT 

20 421 TCCATTGACG TCAATGGGTG GAGTATTTAC GGTAAACTGC CCACTTGGCA 

GTACATCAAG 

481 TGTATCATAT GCCAAGTACG CCCCCTATTG ACGTCAATGA CGGTAAATGG 
CCCGCCTGGC 

541 ATTATGCCCA GTACATGACC TTATGGGACT TTCCTACTTG GCAGTACATC 
25 TACGTATTAG 

601 TCATCGCTAT TACCATGGTG ATGCGGTTTT GGCAGTACAT CAATGGGCGT 
GGATAGCGGT 

661 TTGACTCACG GGGATTTCCA AGTCTCCACC CCATTGACGT CAATGGGAGT 
TTGTTTTGGC 

30 721 ACCAAAATCA ACGGGACTTT CCAAAATGTC GTAACAACTC CGCCCCATTG 

ACGCAAATGG 

781 GCGGTAGGCG TGTACGGTGG GAGGTCTATA TAAGCAGCGC GTTTTGCCTG 
TACTGGGTCT 

841 CTCTGGTTAG ACCAGATCTG AGCCTGGGAG CTCTCTGGCT AACTAGGGAA 
35 CCCACTGCTT 

901 AAGCCTCAAT AAAGCTTGCC TTGAGTGCTT CAAGTAGTGT GTGCCCGTCT 
GTTGTGTGAC 

961 TCTGGTAACT AGAGATCCCT CAGACCCTTT TAGTCAGTGT GGAAAATCTC 
TAGCAGTGGC 

40 1021 GCCCGAACAG GGACTTGAAA GCGAAAGGGA AACCAGAGGA GCTCTCTCGA 

CGCAGGACTC 

1081 GGCTTGCTGA AGCGCGCACG GCAAGAGGCG AGGGGCGGCG ACTGGTGAGT 
ACGCCAAAAA 

1141 TTTTGACTAG CGGAGGCTAG AAGGAGAGAG ATGGGTGCGA GAGCGTCAGT 
45 ATTAAGCGGG 

1201 GGAGAATTAG ATCGCGATGG GAAAAAATTC GGTTAAGGCC AGGGGGAAAG 
AAAAAATATA 

1261 AATTAAAACA TATAGTATGG GCAAGCAGGG AGCTAGAACG ATTCGCAGTT 
AATCCTGGCC 

50 1321 TGTTAGAAAC ATCAGAAGGC TGTAGACAAA TACTGGGACA GCTACAACCA 

TCCCTTCAGA 

13 81 CAGGATCAGA AGAACTTAGA TCATTATATA ATACAGTAGC AACCCTCTAT 
TGTGTGCATC 

1441 AAAGGATAGA GATAAAAGAC ACCAAGGAAG CTTTAGACAA GATAGAGGAA 
55 GAGCAAAACA 

1501 AAAGTAAGAC CACCGCACAG CAAGCGGCCG GCCGCGCTGA TCTTCAGACC 
TGGAGGAGGA 

1561 GATATGAGGG ACAATTGGAG AAGTGAATTA TATAAATATA AAGTAGTAAA 
AATTGAACCA 
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1621 TTAGGAGTAG CACCCACCAA GGCAAAGAGA AGAGTGGTGC AGAGAGAAAA 
AAGAGCAGTG 

1681 GGAATAGGAG CTTTGTTCCT TGGGTTCTTG GGAGCAGCAG GAAGCACTAT 
GGGCGCAGCG 

5 1741 TCAATGACGC TGACGGTACA GGCCAGACAA TTATTGTCTG GTATAGTGCA 

GCAGCAGAAC 

1801 AATTTGCTGA GGGCTATTGA GGCGCAACAG CATCTGTTGC AACTCACAGT 
CTGGGGCATC 

1861 AAGCAGCTCC AGGCAAGAAT CCTGGCTGTG GAAAGATACC TAAAGGATCA 
10 ACAGCTCCTG 

1921 GGGATTTGGG GTTGCTCTGG AAAACTCATT TGCACCACTG CTGTGCCTTG 
GAATGCTAGT 

1981 TGGAGTAATA AATCTCTGGA ACAGATTTGG AATCACACGA CCTGGATGGA 
GTGGGACAGA 

15 2041 GAAATTAACA ATTACACAAG CTTAATACAC TCCTTAATTG AAGAATCGCA 

AAACCAGCAA 

2101 GAAAAGAATG AACAAGAATT ATTGGAATTA GATAAATGGG CAAGTTTGTG 
GAATTGGTTT 

2161 AACATAACAA ATTGGCTGTG GTATATAAAA TTATTCATAA TGATAGTAGG 
20 AGG CTTGGTA 

2221 GGTTTAAGAA TAGTTTTTGC TGTACTTTCT ATAGTGAATA GAGTTAGGCA 
GGGATATTCA 

22 81 CCATTATCGT TTCAGACCCA CCTCCCAACC CCGAGGGGAC CCGACAGGCC 
CGAAGGAATA 

25 2341 GAAGAAGAAG GTGGAGAGAG AGACAGAGAC AGATCCATTC GATTAGTGAA 

CGGATCGGCA 

2401 CTGCGTGCGC CAATTCTGCA GACAAATGGC AGTATTCATC CACAATTTTA 
AAAGAAAAGG 

2461 GGGGATTGGG GGGTACAGTG CAGGGGAAAG AATAGTAGAC ATAATAGCAA 
30 CAGACATACA 

2521 AACTAAAGAA TTACAAAAAC AAATTACAAA AATT CAAAAT TTTCGGGTTT 
ATTACAGGGA 

2581 CAGCAGAGAT CCAGTTTGGT TAGTACCGGG CCCGCTCTAG AGATCCGACG 
CCGCCATCTC 

35 2641 TAGGCCCGCG CCGGCCCCCT CGCACAGACT TGTGGGAGAA GCTCGGCTAC 

TCCCCTGCCC 

2701 CGGTTAATTT GCATATAATA TTTCCTAGTA ACTATAGAGG CTTAATGTGC 
GATAAAAGAC 

2761 AGATAATCTG TTCTTTTTAA TACTAGCTAC ATTTTACATG ATAGGCTTGG 
40 ATTTCTATAA 

2 821 GAGATACAAA TACTAAATTA TTATTTTAAA AAACAGCACA AAAGGAAACT 
CACCCTAACT 

2881 GTAAAGTAAT TGTGTGTTTT GAGACTATAA ATATCCCTTG GAGAAAAGCC 
TTGTTAACGC 

45 2941 GCGGTGACCC TCGAGGTCGA CGGTATCGAT AAGCTCGCTT CACGAGATTC 

CAGCAGGTCG 

3001 AGGGACCTAA TAACTTCGTA TAGCATACAT TATACGAAGT TATATTAAGG 
GTTCCAAGCT 

3061 TAAGCGGCCG CGTGGATAAC CGTATTACCG CCATGCATTA GTTATTAATA 
50 GTAATCAATT 

3121 ACGGGGTCAT TAGTTCATAG CCCATATATG GAGTTCCGCG TTACATAACT 
TACGGTAAAT 

3181 GGCCCGCCTG GCTGACCGCC CAACGACCCC CGCCCATTGA CGTCAATAAT 
GACGTATGTT 

55 3241 CCCATAGTAA CGCCAATAGG GACTTTCCAT TGACGTCAAT GGGTGGAGTA 

TTTACGGTAA 

3301 ACTGCCCACT TGGCAGTACA TCAAGTGTAT CATATGCCAA GTACGCCCCC 
TATTGACGTC 
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3361 AATGACGGTA AATGGCCCGC 
GGACTTTCCT 

3421 ACTTGGCAGT ACATCTACGT 
GTTTTGGCAG 
5 3481 TACATCAATG GGCGTGGATA 

CCACCCCATT 

3541 GACGTCAATG GGAGTTTGTT 
ATGTCGTAAC 

3601 AACTCCGCCC CATTGACGCA 
10 CTATATAAGC 

3661 AGAGCTGGTT TAGTGAACCG 
ATGGTGAGCA 

3721 AGGGCGAGGA GCTGTTCACC 
GGCGACGTAA 
15 37 81 ACGGCCACAA GTTCAGCGTG 

GGCAAGCTGA 

3841 CCCTGAAGTT CATCTGCACC 
CTCGTGACCA 

3 901 CCCTGACCTA CGGCGTGCAG 
20 CAGCACGACT 

3 961 TCTTCAAGTC CGCCATGCCC 
TTCAAGGACG 

4021 ACGGCAACTA CAAGACCCGC 
GTGAACCGCA 
25 4081 TCGAGCTGAA GGGCATCGAC 

AAGCTGGAGT 

4141 ACAACTACAA CAGCCACAAC 
GGCATCAAGG 

4201 TGAACTTCAA GATCCGCCAC 
30 GACCACTACC 

4261 AGCAGAACAC CCCCATCGGC 
TACCTGAGCA 

4321 CCCAGTCCGC CCTGAGCAAA 
CTGCTGGAGT 
35 43 81 TCGTGACCGC CGCCGGGATC 

GAATTCGTCG 

4441 AGGGACCTAA TAACTTCGTA 
TAAGGGTTCC 

4501 GGTTCCACTA GGTACAATTC 
40 GATTACAAAA 

4561 TTTGTGAAAG ATTGACTGGT 
TGTGGATACG 

4621 CTGCTTTAAT GCCTTTGTAT 
TTCTCCTCCT 
45 4681 TGTATAAATC CTGGTTGCTG 

AGGCAACGTG 

4741 GCGTGGTGTG CACTGTGTTT 
GCCACCACCT 

4801 GTCAGCTCCT TTCCGGGACT 
50 GAACTCATCG 

4861 CCGCCTGCCT TGCCCGCTGC 
AATTCCGTGG 

4921 TGTTGTCGGG GAAATCATCG 
ACCTGGATTC 
55 4981 TGCGCGGGAC GTCCTTCTGC 

CTTCCTTCCC 

5041 GCGGCCTGCT GCCGGCTCTG 
CAGACGAGTC 



CTGGCATTAT GCCCAGTACA TGACCTTATG 
ATTAGTCATC GCTATTACCA TGGTGATGCG 
GCGGTTTGAC TCACGGGGAT TTCCAAGTCT 
TTGGCACCAA AATCAACGGG ACTTTCCAAA 
AATGGGCGGT AGGCGTGTAC GGTGGGAGGT 
TCAGATCCGC TAGCGCTACC GGTCGCCACC 
GGGGTGGTGC CCATCCTGGT CGAGCTGGAC 
TCCGGCGAGG GCGAGGGCGA TGCCACCTAC 
ACCGGCAAGC TGCCCGTGCC CTGGCCCACC 
TGCTTCAGCC GCTACCCCGA CCACATGAAG 
GAAGGCTACG TCCAGGAGCG CACCATCTTC 
GCCGAGGTGA AGTTCGAGGG CGACACCCTG 
TTCAAGGAGG ACGGCAACAT CCTGGGGCAC 
GTCTATATCA TGGCCGACAA GCAGAAGAAC 
AACATCGAGG ACGGCAGCGT GCAGCTCGCC 
GACGGCCCCG TGCTGCTGCC CGACAACCAC 
GACCCCAACG AGAAGCGCGA TCACATGGTC 
ACTCTCGGCA TGGACGAGCT GTACAAGTAG 
TAGCATACAT TATACGAAGT TATACATGTT 
GATATCAAGC TTATCGATAA TCAACCTCTG 
ATTCTTAACT ATGTTGCTCC TTTTACGCTA 
CATGCTATTG CTTCCCGTAT GGCTTTCATT 
TCTCTTTATG AGGAGTTGTG GCCCGTTGTC 
GCTGACGCAA CCCCCACTGG TTGGGGCATT 
TTCGCTTTCC CCCTCCCTAT TGCCACGGCG 
TGGACAGGGG CTCGGCTGTT GGGCACTGAC 
TCCTTTCCTT GGCTGCTCGC CTGTGTTGCC 
TACGTCCCTT CGGCCCTCAA TCCAGCGGAC 
CGGCCTCTTC CGCGTCTTCG CCTTCGCCCT 
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5101 GGATCTCCCT TTGGGCCGCC 
AGACCTAGAA 

5161 AAACATGGAG CAATCACAAG 
TGCCTGGCTA 

5221 GAAGCACAAG AGGAGGAGGA 
TTTAAGACCA 

52 81 ATGACTTACA AGGCAGCTGT 
GGGACTGGAA 

5341 GGGCTAATTC ACTCCCAACG 
CCACACACAA 

5401 GGCTACTTCC CTGATTGGCA 
TCCACTGACC 

5461 TTTGGATGGT GCTACAAGCT 
AGC CAATGAA 

5521 GGAGAGAACA CCCGCTTGTT 
CCCGGAGAGA 

5581 GAAGTATTAG AGTGGAGGTT 
CCGAGAGCTG 

5641 CATCCGGACT GTACTGGGTC 
GCTCTCTGGC 

57 01 TAACTAGGGA ACCCACTGCT 
TCAAGTAGTG 

5761 TGTGCCCGTC TGTTGTGTGA 
TTAGTCAGTG 

5821 TGGAAAATCT CTAGCAGCAT 
CCGTAAAAAG 

5881 GCCGCGTTGC TGGCGTTTTT 
CAAAAATCGA 

5941 CGCTCAAGTC AGAGGTGGCG 
GTTTCCCCCT 

6001 GGAAGCTCCC TCGTGCGCTC 
CCTGTCCGCC 

6061 TTTCTCCCTT CGGGAAGCGT 
TCTCAGTTCG 

6121 GTGTAGGTCG TTCGCTCCAA 
GCCCGACCGC 

6181 TGCGCCTTAT CCGGTAACTA 
CTTATCGCCA 

6241 CTGGCAGCAG CCACTGGTAA 
TGCTACAGAG 

6301 TTCTTGAAGT GGTGGCCTAA 
TATCTGCGCT 

6361 CTGCTGAAGC CAGTTACCTT 
CAAACAAACC 

6421 ACCGCTGGTA GCGGTGGTTT 
AAAAAAAGGA 

6481 TCTCAAGAAG ATCCTTTGAT 
CGAAAACTCA 

6541 CGTTAAGGGA TTTTGGTCAT 
CCTTTTAAAT 

6601 TAAAAATGAA GTTTTAAATC 
TGACAGTTAC 

6661 CAATGCTTAA TCAGTGAGGC 
ATCCATAGTT 

6721 GCCTGACTCC CCGTCGTGTA 
TGGCCCCAGT 

6781 GCTGCAATGA TACCGCGAGA 
AATAAACCAG 



TCCCCGCATC GATACCGTCG ACCTCGATCG 
TAGCAATACA GCAGCTACCA ATGCTGATTG 
GGTGGGTTTT CCAGTCACAC CTCAGGTACC 
AGATCTTAGC CACTTTTTAA AAGAAAAGGG 
AAGACAAGAT ATCCTTGATC TGTGGATCTA 
GAACTACACA CCAGGGCCAG GGATCAGATA 
AGTACCAGTT GAGCAAGAGA AGGTAGAAGA 
ACACCCTGTG AGCCTGCATG GGATGGATGA 
TGACAGCCGC CTAGCATTTC ATCACATGGC 
TCTCTGGTTA GACCAGATCT GAGCCTGGGA 
TAAGCCTCAA TAAAGCTTGC CTTGAGTGCT 
CTCTGGTAAC TAGAGATCCC TCAGACCCTT 
GTGAGCAAAA GGCCAGCAAA AGGCCAGGAA 
CCATAGGCTC CGCCCCCCTG ACGAGCATCA 
AAACCCGACA GGACTATAAA GATACCAGGC 
TCCTGTTCCG ACCCTGCCGC TTACCGGATA 
GGCGCTTTCT CATAGCTCAC GCTGTAGGTA 
GCTGGGCTGT GTGCACGAAC CCCCCGTTCA 
TCGTCTTGAG TCCAACCCGG TAAGACACGA 
CAGGATTAGC AGAGCGAGGT ATGTAGGCGG 
CTACGGCTAC ACTAGAAGAA CAGTATTTGG 
CGGAAAAAGA GTTGGTAGCT CTTGATCCGG 
TTTTGTTTGC AAGCAGCAGA TTACGCGCAG 
CTTTTCTACG GGGTCTGACG CTCAGTGGAA 
GAGATTATCA AAAAGGATCT TCACCTAGAT 
AATCTAAAGT ATATATGAGT AAACTTGGTC 
ACCTATCTCA GCGATCTGTC TATTTCGTTC 
GATAACTACG ATACGGGAGG GCTTACCATC 
CCCACGCTCA CCGGCTCCAG ATTTATCAGC 
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6841 CCAGCCGGAA GGGCCGAGCG CAGAAGTGGT CCTGCAACTT TATCCGCCTC 
CATCCAGTCT 

6901 ATTAATTGTT GCCGGGAAGC TAGAGTAAGT AGTTCGCCAG TTAATAGTTT 
GCGCAACGTT 

5 6961 GTTGCCATTG CTACAGGCAT CGTGGTGTCA CGCTCGTCGT TTGGTATGGC 

TTCATTCAGC 

7021 TCCGGTTCCC AACGATCAAG GCGAGTTACA TGATCCCCCA TGTTGTGCAA 
AAAAGCGGTT 

7081 AGCTCCTTCG GTCCTCCGAT CGTTGTCAGA AGTAAGTTGG CCGCAGTGTT 
10 ATCACTCATG 

7141 GTTATGGCAG CACTGCATAA TTCTCTTACT GTCATGC CAT CCGTAAGATG 
CTTTTCTGTG 

72 01 ACTGGTGAGT ACTCAACCAA GTCATTCTGA GAATAGTGTA TGCGGCGACC 
GAGTTGCTCT 

15 7261 TGCCCGGCGT CAATACGGGA TAATACCGCG CCACATAGCA GAACTTTAAA 

AGTGCTCATC 

7321 ATTGGAAAAC GTTCTTCGGG GCGAAAACTC TCAAGGATCT TACCGCTGTT 
GAGATCCAGT 

73 81 TCGATGTAAC CCACTCGTGC ACCCAACTGA TCTTCAGCAT CTTTTACTTT 
20 CACCAGCGTT 

7441 TCTGGGTGAG CAAAAACAGG AAGGCAAAAT GCCGCAAAAA AGGGAATAAG 
GGCGACACGG 

75 01 AAATGTTGAA TACTCATACT CTTCCTTTTT CAATATTATT GAAGCATTTA 
TCAGGGTTAT 

25 7561 TGTCTCATGA GCGGATACAT ATTTGAATGT ATTTAGAAAA ATAAACAAAT 

AGGGGTTCCG 

7621 CGCACATTTC CCCGAAAAGT GCCACCTGAC 

// 

30 (SEQ ID NO: 9) 
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Equivalents 

20 [00353] Those skilled in the art will recognize, or be able to ascertain using no 

more than routine experimentation, many equivalents to the specific embodiments of 
the invention described herein. The Examples below are provided to illustrate the 
invention and are not limiting. Alternative procedures known to one of ordinary skill 
in the art might also be used. The scope of the present invention is not intended to be 

25 limited to the above Description, but rather is as set forth in the appended claims. 

26 
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1 


We claim: 


2 


1. 


A lentiviral vector comprising the following elements: a nucleic acid whose 


3 




sequence includes (i) a functional packaging signal; (ii) a multiple cloning site 


4 




(MCS); and (iii) at least one additional element selected from the group 


5 




consisting of: a second MCS, a second MCS into which a heterologous nucleic 


6 




acid is inserted, an HIV FLAP element, an expression-enhancing 


7 




posttranscriptional regulatory element, a target site for a site-specific 


8 




recombinase, and a self-inactivating (SIN) LTR, wherein the lentiviral vector 


9 




is a lentiviral transfer plasmid or an infectious lentiviral particle. 


10 


2. 


The lentiviral vector of claim 1, wherein the vector comprises at least two 


11 




elements selected from the group consisting of: a second MCS, a second MCS 


12 




into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 


13 




expression-enhancing posttranscriptional regulatory element, a target site for a 


14 




site-specific recombinase, and a self-inactivating (SIN) LTR. 


15 


3. 


The lentiviral vector of claim 1, wherein the vector comprises at least three 


16 




elements selected from the group consisting of: a second MCS, a second MCS 


17 




into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 


18 




expression-enhancing posttranscriptional regulatory element, a target site for a 


19 




site-specific recombinase, and a self-inactivating (SIN) LTR. 


20 


4. 


The lentiviral vector of claim 1, wherein the vector comprises at least four 


21 




elements selected from the group consisting of: a second MCS, a second MCS 


22 




into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 


23 




expression-enhancing posttranscriptional regulatory element, a target site for a 


24 




cifp-Qnprifir rpmrnhina<;e and a self-inactivatint? fSIN^ LTR 


25 


5. 


The lentiviral vector of claim 1, wherein the vector comprises a second MCS, 


26 




an HIV FLAP element, an expression-enhancing posttranscriptional regulatory 


27 




element, a target site for a site-specific recombinase, and a self-inactivating 


28 




(SIN) LTR. 
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1 6. The lentiviral vector of claim 1 , wherein the vector comprises a second MCS 

2 into which a heterologous nucleic acid is inserted, an HIV FLAP element, an 

3 expression-enhancing posttranscriptional regulatory element, a target site for a 

4 site-specific recombinase, and a self-inactivating (SIN) LTR. 

5 7 . The lentiviral vector of claim 1 , wherein the additional element is a second 

6 MCS. 

7 8 . The lentiviral vector of claim 1 , wherein the additional element is a second 

8 MCS into which a heterologous nucleic acid is inserted. 

9 9. The lentiviral vector of claim 1 , wherein the vector has unique restriction sites 

10 for at least 4 enzymes selected from the group consisting of NotI, Apal, Xhol, 

1 1 Xbal, Hpal, Nhel, Pad, Nsil, SphI, Sma/Xma, AccI, BamHI, and Sphl. 

12 10. The lentiviral vector of claim 1 , wherein the vector has unique restriction sites 

13 for at least 5, at least 6, at least 7, at least 8, at least 9, at least 1 0, at least 1 1, at 

14 least 12, or at least 13 enzymes selected from the group consisting of NotI, 

1 5 Apal, Xhol, Xbal, Hpal, Nhel, Pad, Nsil, Sphl, Sma/Xma, AccI, BamHI, and 

16 Sphl. 

17 11. The lentiviral vector of claim 1 , wherein the additional element is an HIV 

18 FLAP element. 

19 12. The lentiviral vector of claim 1 , wherein the additional element is an 

20 expression-enhancing posttranscriptional regulatory element. 

21 13. The lentiviral vector of claim 1 2, wherein the expression-enhancing 

22 posttranscriptional regulatory element is a WRE. 

23 14. The lentiviral vector of claim 1 , wherein the additional element is a target site 

24 for a site-specific recombinase. 

25 15. The lentiviral vector of claim 14, wherein the site is a loxP site. 
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1 16. The lentiviral vector of claim 1 , wherein the lentiviral vector is a lentiviral 

2 transfer plasmid. 

3 17. The lentiviral transfer plasmid of claim 1 6, wherein the plasmid has a size of 

4 less than 10 kB. 

5 18. The lentiviral transfer plasmid of claim 16, wherein the plamid has a size of 

6 less than 9 kB. 

7 19. The lentiviral transfer plasmid of claim 1 6, wherein the plasmid has a size of 

8 less than 8 kB. 

9 20. The lentiviral transfer plasmid of claim 16, wherein the plasmid has a size of 

10 less than 7 kB. 

11 21 . The lentiviral transfer plasmid of claim 1 6, wherein the plasmid has a size of 

12 approximately 6 kB. 

1 3 22. The lentiviral vector of claim 1 , wherein the lentiviral vector is an infectious 

1 4 lentiviral particle. 

15 23 . The lentiviral vector of claim 1 , further comprising: a heterologous promoter 

16 or promoter-enhancer. 

1 7 24. The lentiviral vector of claim 23, wherein the heterologous promoter or 

18 promoter-enhancer is selected from the group consisting of: the CMV 

19 promoter, the CMV promoter-enhancer, and the ubiquitin C promoter. 

20 25. The lentiviral vector of claim 24, wherein the heterologous promoter is an 

2 1 inducible promoter. 

22 26. The lentiviral vector of claim 24, wherein the heterologous promoter is a cell 

23 type specific or tissue specific promoter. 

24 27. The lentiviral vector of claim 23, wherein the heterologous promoter is an 

25 RNA polymerase promoter. 
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1 28. The lentiviral vector of claim 27, wherein the RNA polymerase promoter is an 

2 RNA polymerase Ht promoter. 

3 29. The lentiviral vector of claim 28, wherein the RNA polymerase III promoter is 

4 a U6 promoter. 

5 30. The lentiviral vector of claim 28, wherein the RNA polymerase III promoter is 

6 an HI promoter. 

7 31. The lentiviral vector of claim 27, wherein the RNA polymerase promoter is an 

8 RNA polymerase II promoter. 

9 32. The lentiviral vector of claim 23, further comprising a second heterologous 

1 0 promoter or promoter-enhancer. 

11 33 . The lentiviral vector of claim 1 , further comprising a heterologous nucleic acid 

12 encoding a selectable marker operably linked to a promoter. 

1 3 34. The lentiviral vector of claim 1 , further comprising a heterologous nucleic acid 

14 encoding a reporter molecule operably linked to a promoter. 

15 35. The lentiviral vector of claim 34, wherein the reporter molecule is selected 

16 from the group consisting of: GFP, EGFP, dsRed, dsRed2, cyan fluorescent 

17 protein, yellow fluorescent protein, blue fluorescent protein, dsRed, dsRed2, 

1 8 luciferase, and aequorin. 

19 36. The lentiviral vector of claim 34, further comprising an RNA polymerase 

20 promoter. 

21 37. The lentiviral vector of claim 36, wherein the RNA polymerase promoter is an 

22 RNA polymerase m promoter. 

23 38. The lentiviral vector of claim 1, wherein the lentiviral vector is a transfer 

24 plasmid, further comprising a genetic element sufficient for stable 

25 maintenance of the transfer plasmid as an episome within mammalian cells. 

26 39. A lentiviral vector comprising an RNA polymerase EI promoter. 
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1 40. The lentiviral vector of claim 39, wherein the RNA polymerase III promoter is 

2 a U6 promoter. 

3 41. The lentiviral vector of claim 3 9, wherein the RNA polymerase m promoter is 

4 an HI promoter. 

5 42. The lentiviral vector of claim 39, further comprising a heterologous nucleic 

6 acid encoding a reporter molecule. 

7 43. A lentiviral vector having a sequence as set forth in SEQ ID NO: 2, SEQ ID 

8 NO: 3, SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ 

9 ID NO: 8, or SEQ ID NO: 9. 

10 44. A collection of at least two of the lentiviral vectors of claim 43 . 

11 45 . The collection of claim 44, wherein the collection includes a vector 

12 comprising a first heterologous promoter element and a vector comprising a 

13 second heterologous promoter element different from the first promoter 

14 element. 

15 46. The collection of claim 44, wherein the collection includes a vector 

16 comprising a first heterologous reporter gene and a vector comprising a 

17 second reporter gene different from the first reporter gene. 

18 47. A lentiviral vector having a sequence that differs by not more than 1 00 

19 nucleotides from the sequence set forth in SEQ ID NO: 2, SEQ ID NO: 3, 

20 SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ ID NO: 

21 8, or SEQ ID NO: 9. 

22 48. A collection of at least two of the lentiviral vectors of claim 47. 

23 49. The collection of claim 47, wherein the collection includes a vector 

24 comprising a first heterologous promoter element and a vector comprising a 

25 second heterologous promoter element different from the first promoter 

26 element. 
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1 50. The collection of claim 47, wherein the collection includes a vector 

2 comprising a first heterologous reporter gene and a vector comprising a 

3 second reporter gene different from the first reporter gene. 

4 51. A lentiviral vector having a sequence that differs by not more than X 

5 nucleotides from the sequence set forth in SEQ ID NO: 2, SEQ ID NO: 3, 

6 SEQ ID NO: 4, SEQ ID NO: 5, SEQ ID NO: 6, SEQ ID NO: 7, SEQ EDNO: 

7 8, or SEQ ID NO: 9, where X represents any number between 1 and 99, 

8 inclusive. 

9 52. A collection of at least two of the lentiviral vectors of claim 5 1 . 

10 53. A collection of at least two of the lentiviral vectors of claim 5 1 . 

11 54. The collection of claim 53, wherein the collection includes a vector 

12 comprising a first heterologous promoter element and a vector comprising a 

1 3 second heterologous promoter element different from the first promoter 

14 element. 

15 55. The collection of claim 53, wherein the collection includes a vector 

16 comprising a first heterologous reporter gene and a vector comprising a 

1 7 second reporter gene different from the first reporter gene. 

18 56. A three-plasmid lentiviral expression system comprising: 

19 (a) a first plasmid whose sequence comprises a nucleic acid sequence 

20 of at least part of a lentiviral genome, wherein the plasmid (i) contains at least 

21 one defect in at least one gene encoding a lentiviral structural protein, and (ii) 

22 lacks a functional packaging signal; 

23 (b) a second plasmid whose sequence comprises a nucleic acid 

24 sequence of a virus, wherein the plasmid (i) expresses a viral envelope protein, 

25 and (ii) lacks a functional packaging signal; and 

26 (c) a third plasmid whose nucleic acid sequence includes (i) a 

27 functional packaging signal; (ii) a multiple cloning site (MCS); and (iii) at 

28 least one additional element selected from the group consisting of: a second 

29 MCS, a second MCS into which a heterologous nucleic acid is inserted; an 
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1 HIV FLAP element, an expression-enhancing posttranscriptional regulatory 

2 element, a target site for a site-specific recombinase, and a self-inactivating 

3 (SIN) LTR. 

4 57. A four plasmid lentiviral expression system comprising the three plasmid 

5 lentiviral expression system of claim 56, further comprising a fourth plasmid 

6 comprising a nucleic acid segment that encodes Rev, operably linked to a 

7 promoter. 

8 58. A cell comprising the lentiviral vector of claim 1 . 

9 59. The cell of claim 58, wherein the cell comprises a nucleic acid or nucleic acids 

10 having sequences encoding Gag, Pol, and Env proteins. 

11 60. A cell comprising a provirus derived from the lentiviral vector of claim 1 . 

12 61 . A transgenic animal, at least some of whose cells contain the lentiviral vector 

13 of claim 1. 

14 62. A transgenic animal, at least some of whose cells contain a provirus derived 

1 5 from the lentiviral vector of claim 1 . 

16 63. A method of creating a producer cell line comprising introducing the lentiviral 

17 vector of claim 1 into a host cell, wherein the lentiviral vector is a transfer 

18 plasmid; and introducing a packaging plasmid and an envelope plasmid into 

19 the host cell. 

20 64. A method of producing lentiviral particles comprising 

21 (i) introducing the lentiviral vector of claim 1 into a helper cell, 

22 wherein the lentiviral vector is a transfer plasmid comprising a genetic 

23 element sufficient for stable maintenance of the plasmid as an episome in 

24 mammalian cells, into a helper cell that expresses proteins required for 

25 production of infectious lentiviral particles; and 

26 (ii) culturing the cell for a period sufficient to allow production of 

27 lentiviral particles. 
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1 65. A method of producing lentiviral particles comprising 

2 (i) introducing the lentiviral vector of claim 1, which lentiviral vector 

3 is a lentiviral transfer plasmid comprising a genetic element sufficient for 

4 stable maintenance of the transfer plasmid as an episome in mammalian cells, 

5 into a helper cell that expresses a protein required for production of lentiviral 

6 particles, wherein expression of the protein is under control of an inducible 

7 promoter; 

8 (ii) inducing expression of the protein required for production of 

9 lentiviral particles; and 

10 (iii) culturing the cell for a period sufficient to allow production of 

1 1 lentiviral particles. 

12 66. A method of expressing a heterologous nucleic acid in a target cell comprising 

13 introducing a lentiviral vector of claim 1 into the target cell, wherein 

14 the lentiviral vector comprises a heterologous nucleic acid operably linked to a 

15 promoter; and 

1 6 expressing the heterologous nucleic acid therein. 

17 67. A method for achieving controlled expression of a heterologous nucleic acid 

18 in a cell comprising steps of: 

19 (i) providing a modified lentiviral vector comprising a heterologous 

20 nucleic acid inserted between sites for a recombinase; 

21 (ii) introducing the modified lentiviral vector or a portion thereof 

22 including at least the sites for the recombinase and the region between the sites 

23 into the cell and; 

24 (iii) subsequently inducing expression of the recombinase within the 

25 cell, thereby preventing expression of the heterologous nucleic acid within the 

26 cell. 

27 68. The method of claim 67, wherein the providing step comprises inserting the 

28 heterologous nucleic acid into a lentiviral vector between sites for a 

29 recombinase, thereby producing a modified lentiviral vector. 

30 69. The method of claim 67, wherein the cell is a mammalian cell. 
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1 70. A method for expressing a transcript in a mammal in a cell type or tissue- 

2 specific maimer comprising: 

3 (i) delivering a lentiviral vector to cells of the mammal, wherein the 

4 lentiviral vector comprises a heterologous nucleic acid operably linked to a 

5 promoter so that transcription from the promoter results in synthesis of the 

6 transcript, and wherein the heterologous nucleic acid is located between sites 

7 for a site-specific recombinase; and 

8 (ii) inducing expression of the site-specific recombinase in a subset of 

9 the cells of the mammal, thereby preventing synthesis of the transcript within 

10 those cells. 

11 71. The method of claim 67 or claim 70, wherein the step of inducing the site- 

12 specific recombinase comprises introducing a vector encoding the site-specific 

13 recombinase into the cell. 

14 72. The method of claim 67 or claim 70, wherein expression of the site-specific 

15 recombinase is under control of a cell type specific or tissue specific promoter. 

16 73. The method of claim 67 or claim 70, wherein the sites are loxP sites and the 

17 site-specific recombinase is loxP. 

18 74. A lentiviral vector whose presence within a cell results in transcription of one 

19 or more ribonucleic acids (RNAs) that self-hybridize or hybridize to each 

20 other to form a short hairpin RNA or short interfering RNA that inhibits 

21 expression of at least one target transcript in the cell. 

22 75. The lentiviral vector of claim 74, wherein the vector provides a template for 

23 synthesis of an RNA that self-hybridizes to form an shRNA that is targeted to 

24 the transcript. 

25 76. The lentiviral vector of claim 75, wherein the shRNA comprises a loop having 

26 a sequence set forth in SEQ ID NO: 1 0. 
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1 77. The lentiviral vector of claim 74, wherein the vector provides a template for 

2 synthesis of complementary RNAs that hybridize with each other to form an 

3 siRNA that is targeted to the transcript. 

4 78. The lentiviral vector of claim 74, wherein the vector comprises a nucleic acid 

5 segment operably linked to a promoter, so that transcription from the promoter 

6 results in synthesis of one or more RNAs that self-hybridize or hybridize with 

7 each other to form an shRNA or siRNA targeted to the transcript. 

8 79. The composition of claim 74, wherein the lentiviral vector is a lentiviral 

9 transfer plasmid. 

1 0 80. The composition of claim 74, wherein the lentiviral vector is an infectious 

1 1 lentiviral particle. 

12 81. The lentiviral vector of claim 74, wherein: 

13 the shRNA or siRNA comprises a base-paired region approximately 1 9 

14 nucleotides long. 

1 5 82. The lentiviral vector of claim 74, wherein: 

16 the shRNA or siRNA comprises a base-paired region and at least one 

17 single-stranded overhang. 

18 83 . The lentiviral vector of claim 74 wherein: 

19 the siRNA or shRNA comprises a 3* overhang consisting of at least 

20 two pyrimidines. 

2 1 84. The lentiviral vector of claim 83, wherein the 3 ' overhang is UU. 

22 85 . The lentiviral vector of claim 74, wherein: 

23 the shRNA or siRNA comprises a region that is precisely 

24 complementary with a region of the target transcript. 

25 86. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

26 level sufficient to reduce the level of the target transcript or its encoded 

27 protein by at least about 2 fold. 
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1 87. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

2 level sufficient to reduce the level of the target transcript or its encoded 

3 protein by at least about 5 fold. 

4 88. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

5 level sufficient to reduce the level of the target transcript or its encoded 

6 protein by at least about 1 0 fold. 

7 89. The lentiviral vector of claim 74, wherein the siRNA or shRNA is present at a 

8 level sufficient to reduce the level of the target transcript or its encoded 

9 protein by at least about 25 fold. 

10 90. The lentiviral vector of claim 74, wherein the lentiviral vector comprises: 

11 (i) a functional packaging signal; 

12 (ii) a multiple cloning site (MCS); and 

13 (iii) at least one additional element selected from the group consisting 

14 of: a second MCS, a second MCS into which a heterologous promoter or 

15 promoter-enhancer is inserted, an HIV FLAP element, an expression- 

16 enhancing posttranscriptional regulatory element, a target site for a site- 

17 specific recombinase, and a self-inactivating (SIN) LTR. 

18 91. A composition comprising: 

19 the lentiviral vector of claim 74; and 

20 a delivery agent that enhances delivery of the vector to cells. 

21 92. A pharmaceutical composition comprising: 

22 the lentiviral vector of claim 74; and 

23 a pharmaceutically acceptable carrier. 

24 93. A three plasmid lentiviral expression system comprising (i) a lentiviral transfer 

25 plasmid comprising a heterologous nucleic acid operably linked to a promoter, 

26 so that transcription of the heterologous nucleic acid produces one or more 

27 RNAs that self-hybridize or hybridize with each other to form an shRNA or 

28 siRNA targeted to a target transcript; (ii) a packaging plasmid; and (iii) an 

29 Env-coding plasmid. 
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1 94. A four plasmid lentiviral expression system comprising the three plasmid 

2 lentiviral expression system of claim 93, further comprising a fourth plasmid 

3 comprising a nucleic acid segment that encodes Rev, operably linked to a 

4 promoter. 

5 95. A method of inhibiting or reducing the expression of a target transcript in a 

6 cell comprising delivering the lentiviral vector of claim 74 to the cell. 

7 96. The method of claim 95, wherein the lentiviral vector comprises: 

8 (i) a functional packaging signal; 

9 (ii) a multiple cloning site (MCS); and 

10 (iii) at least one additional element selected from the group consisting 

11 of: a second MCS, a second MCS into which a heterologous promoter or 

12 promoter-enhancer is inserted, an HIV FLAP element, an expression- 

13 enhancing posttranscriptional regulatory element, a target site for a site- 

14 specific recombinase, and a self-inactivating (SIN) LTR . 

15 97. The method of claim 95, wherein the cell is a mammalian cell. 

16 98. The method of claim 95, wherein the cell is a primary cell. 

17 99, The method of claim 95, wherein the primary cell is a T cell. 

18 100. The method of claim 95, wherein the cell is a non-dividing cell. 

19 101. The method of claim 95, wherein the cell is an embryonic stem cell. 

20 102. The method of claim 95, wherein the cell is a single-cell embryo. 

21 103. The method of claim 95, wherein the lentiviral vector is a lentiviral transfer 

22 plasmid. 

23 104. The method of claim 95, wherein the lentiviral vector is an infectious lentiviral 

24 particle. 
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1 105. The method of claim 95, wherein the ribonucleic acid comprises 

2 complementary regions that self-hybridize to form a short hairpin RNA 

3 targeted to the transcript. 

4 1 06. A method of reversibly inhibiting or reducing expression of a target transcript 

5 in a cell comprising steps of: 

6 (i) delivering a lentiviral vector to the cell, wherein presence of the 

7 lentiviral vector within the cell results in synthesis of one or more RNAs that 

8 self-hybridize or hybridize with each other to form an shRNA or siRNA that 

9 inhibits expression of the target transcript, wherein the lentiviral vector 

10 comprises a nucleic acid segment located between sites for a site-specific 

1 1 recombinase, which nucleic acid segment provides a template for transcription 

12 of the one or more RNAs; and (ii) inducing expression of the site-specific 

13 recombinase within the cell, thereby preventing synthesis of at least one of the 

14 RNAs. 

15 107. The method of claim 105, wherein the cell is a mammalian cell. 

16 108. The method of claim 105, wherein the recombinase is Cre and the sites are 

17 loxP sites. 

18 109. The method of claim 105, wherein the lentiviral vector is a lentiviral transfer 

19 plasmid. 

20 110. The method of claim 105, wherein the lentiviral vector is a lentiviral particle. 

21 111. The method of claim 105, wherein the lentiviral vector provides a template for 

22 synthesis of an RNA comprising complementary portions that hybridize to 

23 form an shRNA. 

24 1 12. A method for reversibly inhibiting or reducing expression of a transcript in a 

25 mammal in a cell type or tissue-specific manner comprising: 

26 (i) delivering to the mammal a lentiviral vector whose presence within 

27 a cell results in synthesis of one or more RNAs that self-hybridize or hybridize 

28 with each other to form an shRNA or siRNA that inhibits expression of the 
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1 target transcript, wherein the lentiviral vector comprises a nucleic acid 

2 segment located between sites for a site-specific recombinase, which nucleic 

3 acid segment provides a template for transcription of the RNA; and 

4 (ii) inducing expression of the site-specific recombinase in a subset of 

5 the cells of the mammal, thereby preventing synthesis of at least one of the 

6 RNAs within the subset of cells. 

7 113. The method of claim 1 12, wherein the recombinase is Cre and the sites are 

8 loxP sites. 

9 1 14. The method of claim 112, wherein the lentiviral vector is a lentiviral transfer 

10 plasmid. 

11 115. The method of claim 1 12, wherein the lentiviral vector is a lentiviral particle. 

12 116. The method of claim 112, wherein the lentiviral vector provides a template for 

13 synthesis of an RNA comprising complementary portions that hybridize to 

14 form an shRNA. 

15 117. A method of treating or preventing infection by an infectious agent, the 

16 method comprising steps of: 

17 administering to a subject prior to, simultaneously with, or after 

18 exposure of the subject to the infectious agent, a composition comprising an 

1 9 effective amount of a lentiviral vector, wherein presence of the lentiviral 

20 vector in a cell results in synthesis of one or more RNAs that self-hybridize or 

21 hybridize with each other to form an shRNA or siRNA that is targeted to a 

22 transcript produced during infection by the infectious agent, which transcript 

23 is characterized in that reduction in levels of the transcript delays, prevents, or 

24 inhibits one or more aspects of infection by or replication of the infectious 

25 agent. 

26 118. The method of claim 117, wherein the lentiviral vector provides a template for 

27 synthesis of an RNA that comprises complementary portions that hybridize to 

28 form an shRNA. 
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1 119. A method of treating or preventing a disease or clinical condition, the method 

2 comprising: 

3 removing a population of cells from a subject at risk of or suffering 

4 from disease or clinical condition; 

5 engineering or manipulating the cells to contain an effective amount of 

6 an siRNA or shRNA targeted to a transcript, which transcript is characterized 

7 in that its degradation delays, prevents, or inhibits one or more aspects of the 

8 disease or clinical condition; and 

9 returning at least a portion of the cells to the subj ect. 

10 120. The method of claim 1 19 wherein: 

1 1 the engineering or manipulating step comprises introducing a lentiviral 

1 2 vector into the cells, wherein presence of the lentiviral vector in a cell results 

1 3 in synthesis of one or more RNAs that self-hybridize or hybridize with each 

1 4 other to form an shRNA or siRNA targeted to the transcript. 

15 121. The method of claim 119, wherein: 

16 the cells comprise stem cells. 

17 122. The method of claim 121, wherein: 

18 the stem cells are peripheral blood stem cells. 

19 123. The method of claim 1 1 9, further comprising: 

20 expanding at least a portion of the cells in culture. 

21 124. A kit comprising (a) a lentiviral transfer plasmid comprising a nucleic acid 

22 sequence including (i) a functional packaging signal; (ii) a multiple cloning 

23 site (MCS) into which a heterologous gene may be inserted; and (iii) at least 

24 one additional element selected from the group consisting of: a second MCS, 

25 an HTV FLAP element, a heterologous promoter, a heterologous enhancer, an 

26 expression-enhancing posttranscriptional regulatory element, a target site for a 

27 site-specific recombinase, and a self-inactivating (SIN) LTR; and one or more 

28 of the following items: (b) a packaging mix comprising one or more plasmids 

29 that collectively provide nucleic acid sequences coding for retroviral or 
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1 lentiviral Gag and Pol proteins and an envelope protein; (c) cells (e.g., a cell 

2 line) that are permissive for production of lentiviral particles such as 293T 

3 cells; (d) packaging cells, e.g., a cell line that is permissive for production of 

4 lentiviral particles and provides the proteins Gag, Pol, Env, and, optionally, 

5 Rev; (e) cells suitable for use in titering lentiviral particles; a transfection- 

6 enhancing agent such as Lipofectamine; (f) a selection agent such as an 

7 antibiotic, preferably corresponding to an antibiotic resistanc gene in the 

8 lentiviral transfer plasmid; (g) instructions for use; (h) a positive control 

9 plasmid. 
10 
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A 50% confluent 10cm plate of D7 cells (Bear et al. 2000), was infected with 100ul 
of concentrated pLL3.7 B-catenin lentivirus, which expressed GFP as a transgene 
between two LoxP sites. Infected cells were sorted based upon expression of 
EGFP (A, green line). A 50% confluent 6cm-plate of sorted D7 pLL3.7 b-catenin 
cells was infected with adenovirus expressing the Cre recombinase. 1x10*5 
infectious units were used in the infection. Cells were expanded for 10 days to 
allow for expression of Cre protein, deletion of lox-CMVgfp-lox,and depletion of 
EGFP protein pools. Cells were then analyzed by flow cytometry for expression of 
EGFP (B, pink line). Cells were also sorted based upon loss of EGFP expression 
and expanded. Purple solid peak in A and B represent uninfected control. 
Percentage GFP + cells are shown on each plot. A direct comparison between 
pLL3.7 infected D7 cells before (green line) and after (pink line) Cre delivery is 
seen in C. 
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Figure 23 
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